Building OCR/NER Test Collections

March 16, 2020

Named entity recognition (NER) identifies spans of text that contain names.  Many researchers have reported the results of NER on text created through optical character recognition (OCR) over the past two decades.  Unfortunately, the test collections that support this research are annotated with named entities after OCR has been run.  This means that the collection […]

Read More

HLTCOE a Top Performer at VoxSRC

January 29, 2020

JHU HLTCOE was a top performer in a recent open speaker recognition challenge called VoxSRC, finishing in the top two of more than fifty entries from the international research community. The challenge, hosted by the University of Oxford, was based on the open-source VoxCeleb speech corpus captured from public celebrity videos with automatic speaker labeling […]

Read More

Processing videos by combining visual and audio cues

December 12, 2019

Videos include information in both the visual and audio domains, and so video processing techniques should utilize both of these means for more effective solutions.  The HLTCOE has been researching this strategy since 2017, when recognizing individuals in videos using both voice and face was a topic at the SCALE summer workshop.  The work resulting […]

Read More

Johns Hopkins University

Human Language Technology Center of Excellence

810 Wyman Park Drive, Baltimore, MD 21211

  • 410-516-4800

Human Language Technology Center of Excellence