Automated information extraction of key trial design elements from clinical trial publications

Author	Search for: De Bruijn, Berry¹; Search for: Carini, Simona; Search for: Kiritchenko, Svetlana¹; Search for: Martin, Joel¹; Search for: Sim, Ida
Affiliation	National Research Council Canada. Information and Communication Technologies
Format	Text, Article
Abstract	Clinical trials are one of the most valuable sources of scientific evidence for improving the practice of medicine. The Trial Bank project aims to improve structured access to trial findings by including formalized trial information into a knowledge base. Manually extracting trial information from published articles is costly, but automated information extraction techniques can assist. The current study highlights a single architecture to extract a wide array of information elements from full-text publications of randomized clinical trials (RCTs). This architecture combines a text classifier with a weak regular expression matcher. We tested this two-stage architecture on 88 RCT reports from 5 leading medical journals, extracting 23 elements of key trial information such as eligibility rules, sample size, intervention, and outcome names. Results prove this to be a promising avenue to help critical appraisers, systematic reviewers, and curators quickly identify key information elements in published RCT articles.
Publication date	2008-11-06
Publisher	AMIA
In	AMIA Annual Symposium Proceedings: 141–145.
Language	English
Peer reviewed	Yes
NPARC number	23001918
Export citation	Export as RIS
Report a correction	Report a correction (opens in a new tab)
Record identifier	4189aaa7-0199-42d3-964c-4e03e6964810
Record created	2017-05-24
Record modified	2020-04-15

Page details

From:

National Research Council Canada

Date modified:: 2026-05-30