Poster Presentations

Start Date

20-5-2014 12:30 PM

Description

In recent years, evidence-based medicine has become increasingly important in guiding health care practices. Systematic review, which is the core component of evidence-based medicine, attempts to identify and synthesize all the empirical evidence from online resources such as pubmed to answer a given research question. Usually a clinical researcher needs to choose dozens of related articles as references to work on systematic review. However, there are usually thousand of articles retried from pubmed after keywords searching, it is time consuming to read each of the articles to find the right ones. My work is to apply text mining and machine learning techniques to screen articles automatically, to minimize the articles set without losing any right ones. This project is processing from three aspects: words, sentences and articles. Words are analyzed by counting term frequency, sentences are by parsing syntactic structures and semantic analyzing, articles are by analyzing general features like author, how many articles reference it. And the data sets are imbalance since the ‘right’ articles are only a very small part. So there are a lot of challenges to be addressed. My work is currently focusing on words level, to try different feature selection methods and classifiers to improve the performance.

Comments

Abstract of poster presented at the 2014 UMass Center for Clinical and Translational Science Research Retreat, held on May 20, 2014 at the University of Massachusetts Medical School, Worcester, Mass.

Creative Commons License

Creative Commons Attribution-Noncommercial-Share Alike 3.0 License
This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 License.

 
May 20th, 12:30 PM

Automatic Article Screening in Systematic Review

In recent years, evidence-based medicine has become increasingly important in guiding health care practices. Systematic review, which is the core component of evidence-based medicine, attempts to identify and synthesize all the empirical evidence from online resources such as pubmed to answer a given research question. Usually a clinical researcher needs to choose dozens of related articles as references to work on systematic review. However, there are usually thousand of articles retried from pubmed after keywords searching, it is time consuming to read each of the articles to find the right ones. My work is to apply text mining and machine learning techniques to screen articles automatically, to minimize the articles set without losing any right ones. This project is processing from three aspects: words, sentences and articles. Words are analyzed by counting term frequency, sentences are by parsing syntactic structures and semantic analyzing, articles are by analyzing general features like author, how many articles reference it. And the data sets are imbalance since the ‘right’ articles are only a very small part. So there are a lot of challenges to be addressed. My work is currently focusing on words level, to try different feature selection methods and classifiers to improve the performance.

 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.