Technology ID

Method and System of Building Hospital-Scale Medical Image Database

Lead Inventor
Wang, Xiaosong (CC)
Peng, Yifan (National Library of Medicine, NIH)
Lu, Le (CC)
Lu, Zhiyong (NLM)
Summers, Ronald (CC)
Software / Apps
Development Stages
Lead IC

Developing computer systems that can recognize and locate image features associated with disease is a challenge for developing fully-automated and high precision computer assisted diagnostics. Joint learning of language tasks in association with vision tasks (association of image features with text annotation) adds an additional level of challenge.  Furthermore, scaling-up approaches from small to large datasets presents additional issues, particularly related to medical images. In this case, identifying such features requires specialized skill for even a human and the text descriptions from trained physicians may be variable, complex, and abstract.  The application of deep learning to medical image feature detection in association with language recognition may aid in the development of precision automated computer-aided diagnostics for a wide range of disease conditions, based on large scale PACS datasets.  


The technology developed by researchers at the National Institutes of Health Clinical Center (NIHCC), applies natural language processing techniques and deep learning methods to mine PACS images and generate large-scale image databases. Diseases can be detected and spatially-located within the dataset generated by this method. The generation of such datasets is an important step toward utilizing PACS informatics and development of fully-automated high precision computer diagnostic systems.

Competitive Advantages:

  • Ability to create large-scale labeled medical image database

Commercial Applications:

  • Computer Assisted Diagnostics
  • Medical Image Informatics
Licensing Contact:
Fenn, Edward (Tedd)