Benchmarking the output of large language models against human expert-curated biomedical knowledge graphs

Negin Sadat Babaiha a b, Sathvik Guru Rao a, Jürgen Klein a, Bruce Schultz a, Marc Jacobs a, Martin Hofmann-Apitius:

Biomedical knowledge graphs (KGs) hold valuable information regarding biomedical entities such as genes, diseases, biological processes, and drugs. KGs have been successfully employed in challenging biomedical areas such as the identification of pathophysiology mechanisms or drug repurposing. The creation of high-quality KGs typically requires labor-intensive multi-database integration or substantial human expert curation, both of which take time and contribute to the workload of data processing and annotation. Therefore, the use of automatic systems for KG building and maintenance is a prerequisite for the wide uptake and utilization of KGs.