Authors
Wang, TongDocument Type
Poster AbstractPublication Date
2014-05-20Keywords
BioinformaticsDatabases and Information Systems
Library and Information Science
Theory and Algorithms
Translational Medical Research
Metadata
Show full item recordAbstract
In recent years, evidence-based medicine has become increasingly important in guiding health care practices. Systematic review, which is the core component of evidence-based medicine, attempts to identify and synthesize all the empirical evidence from online resources such as pubmed to answer a given research question. Usually a clinical researcher needs to choose dozens of related articles as references to work on systematic review. However, there are usually thousand of articles retried from pubmed after keywords searching, it is time consuming to read each of the articles to find the right ones. My work is to apply text mining and machine learning techniques to screen articles automatically, to minimize the articles set without losing any right ones. This project is processing from three aspects: words, sentences and articles. Words are analyzed by counting term frequency, sentences are by parsing syntactic structures and semantic analyzing, articles are by analyzing general features like author, how many articles reference it. And the data sets are imbalance since the ‘right’ articles are only a very small part. So there are a lot of challenges to be addressed. My work is currently focusing on words level, to try different feature selection methods and classifiers to improve the performance.DOI
10.13028/3mm2-0m64Permanent Link to this Item
http://hdl.handle.net/20.500.14038/27894Notes
Abstract of poster presented at the 2014 UMass Center for Clinical and Translational Science Research Retreat, held on May 20, 2014 at the University of Massachusetts Medical School, Worcester, Mass.
Rights
Copyright the Author(s)Distribution License
http://creativecommons.org/licenses/by-nc-sa/3.0/ae974a485f413a2113503eed53cd6c53
10.13028/3mm2-0m64