From The Seeds of Science journal:
This article was originally posted on December 7th, 2021 on Markus’ website.
TL;DR: I worked on biomedical literature search, discovery and recommender web applications for many months and concluded that extracting, structuring or synthesizing "insights" from academic publications (papers) or building knowledge bases from a domain corpus of literature has negligible value in industry.
Close to nothing of what makes science actually work is published as text on the web
Here’s the outline:
The Business of Extracting Knowledge from Academic Publications
Psychoanalysis of a Troubled Industry
My Quixotic Escapades in Building Literature Search and Discovery Tools
Fundamental Issues with Structuring Academic Literature as a Business
Just a Paper, an Idea, an Insight Does Not Get You Innovation
Contextual, Tacit Knowledge is not Digital, not Encoded or just not Machine-Interpretable yet
Experts have well defined, internalized maps of their field
Scientific Publishing comes with Signaling, Status Games, Fraud and often Very Little Information
Non-technical Life Science Labor Is Cheap
The Literature is Implicitly Reified in Public Structured Knowledge Bases
Advanced Interactives and Visualizations are Surprisingly Unsatisfying to Consume
Unlike other Business Software, Domain Knowledge Bases of Research Companies are Maximally Idiosyncratic
Divergent Tasks are Hard to Evaluate and Reason About
Public Penance: My Mistakes, Biases and Self-Deceptions
Onwards
Psychoanalysis of a Troubled Industry....
....MUCH MORE
Last week's visit to Seeds of Science was: