<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Human Language Technology Center of Excellence</title>
	<atom:link href="http://hltcoe.jhu.edu/feed/" rel="self" type="application/rss+xml" />
	<link>http://hltcoe.jhu.edu</link>
	<description></description>
	<lastBuildDate>Wed, 22 Feb 2012 15:44:38 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='hltcoe.jhu.edu' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Human Language Technology Center of Excellence</title>
		<link>http://hltcoe.jhu.edu</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://hltcoe.jhu.edu/osd.xml" title="Human Language Technology Center of Excellence" />
	<atom:link rel='hub' href='http://hltcoe.jhu.edu/?pushpress=hub'/>
		<item>
		<title>Wednesday, March 7, 2012: Matt Hoffman: Variational Bayesian Methods for Unsupervised Latent Factor Models of Text and Audio</title>
		<link>http://hltcoe.jhu.edu/2012/02/22/wednesday-march-7-2012-matt-hoffman-variational-bayesian-methods-for-unsupervised-latent-factor-models-of-text-and-audio/</link>
		<comments>http://hltcoe.jhu.edu/2012/02/22/wednesday-march-7-2012-matt-hoffman-variational-bayesian-methods-for-unsupervised-latent-factor-models-of-text-and-audio/#comments</comments>
		<pubDate>Wed, 22 Feb 2012 15:44:34 +0000</pubDate>
		<dc:creator>kdaught2</dc:creator>
				<category><![CDATA[Talks]]></category>

		<guid isPermaLink="false">http://hltcoe.jhu.edu/?p=364</guid>
		<description><![CDATA[Matt Hoffman, Columbia University Variational Bayesian Methods for Unsupervised Latent Factor Models of Text and Audio In this talk, I will discuss variational strategies for fitting two Bayesian models that explain high-dimensional media data in terms of sets of latent factors. The first model, Latent Dirichlet Allocation (LDA), is a popular model of text corpora [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=364&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Matt Hoffman, Columbia University</p>
<p>Variational Bayesian Methods for Unsupervised Latent Factor Models of Text and Audio</p>
<p>In this talk, I will discuss variational strategies for fitting two Bayesian models that explain high-dimensional media data in terms of sets of latent factors.</p>
<p>The first model, Latent Dirichlet Allocation (LDA), is a popular model of text corpora that learns to represent documents as mixtures of latent &#8220;topic&#8221; distributions. We develop an online variational Bayes (VB) algorithm for LDA. Online LDA is based on online stochastic optimization with a natural gradient step, which we show converges to a local optimum of the VB objective function. It can handily analyze massive document collections, including those arriving in a stream. We study the performance of online LDA in several ways, including by fitting a 100-topic topic model to 3.3M articles from Wikipedia in a single pass. We demonstrate that online LDA finds topic models as good as or better than those found with batch VB, and in a fraction of the time.</p>
<p>The second model, Gamma Process Non negative Matrix Factorization (GaP-NMF), is a new Bayesian nonparametric model of audio spectrograms that addresses the problem of latent source discovery and separation in audio recordings. GaP-NMF allows us to discover what sounds (e.g. bass drums, guitar chords, etc.) are present in a recording and to isolate or suppress individual sources. Crucially, this model is able to decide how many latent sources are necessary to model the data. This feature is particularly valuable in this application, since it is impossible to guess a priori how many sounds will appear in a given recording. Although the GaP-NMF model lacks the conditional conjugacy enjoyed by models such as LDA, we are nonetheless able to efficiently fit it to data using a novel variational algorithm.</p>
<p>Bio:</p>
<p>Matthew D. Hoffman is a postdoctoral researcher in the Department of Statistics at Columbia University. He received his Ph.D. in Computer Science from Princeton University in 2010. His research interests are in machine learning, statistical modeling, audio signal processing, content-based music information retrieval, and the associated computational issues.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/hltcoe.wordpress.com/364/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/hltcoe.wordpress.com/364/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/hltcoe.wordpress.com/364/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/hltcoe.wordpress.com/364/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/hltcoe.wordpress.com/364/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/hltcoe.wordpress.com/364/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/hltcoe.wordpress.com/364/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/hltcoe.wordpress.com/364/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/hltcoe.wordpress.com/364/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/hltcoe.wordpress.com/364/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/hltcoe.wordpress.com/364/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/hltcoe.wordpress.com/364/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/hltcoe.wordpress.com/364/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/hltcoe.wordpress.com/364/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=364&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://hltcoe.jhu.edu/2012/02/22/wednesday-march-7-2012-matt-hoffman-variational-bayesian-methods-for-unsupervised-latent-factor-models-of-text-and-audio/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/308742bc09c1585d68805c9dd233753e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kdaught2</media:title>
		</media:content>
	</item>
		<item>
		<title>March 5, 2012: Margaret Mitchell: Generating Descriptions of Visible Objects</title>
		<link>http://hltcoe.jhu.edu/2012/02/21/march-5-2012-margaret-mitchell/</link>
		<comments>http://hltcoe.jhu.edu/2012/02/21/march-5-2012-margaret-mitchell/#comments</comments>
		<pubDate>Tue, 21 Feb 2012 17:19:58 +0000</pubDate>
		<dc:creator>kdaught2</dc:creator>
				<category><![CDATA[Talks]]></category>

		<guid isPermaLink="false">http://hltcoe.jhu.edu/?p=360</guid>
		<description><![CDATA[Margaret Mitchell, University of Aberdeen &#8220;Generating Descriptions of Visible Objects&#8221; Abstract:  What do people describe when they look at objects?  Can we model what they say?  (Why does this matter?)  This talk will characterize what makes up a visual description and define some of the methods necessary to automatically generate such language.  Taking this a [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=360&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Margaret Mitchell, University of Aberdeen</p>
<p>&#8220;Generating Descriptions of Visible Objects&#8221;</p>
<p>Abstract:  What do people describe when they look at objects?  Can we model what they say?  (Why does this matter?)  This talk will characterize what makes up a visual description and define some of the methods necessary to automatically generate such language.  Taking this a bit further, I describe an end-to-end prototype system that reads in computer vision output and generates natural language descriptions.  Time permitting, I argue that improving visual descriptions can also improve computer vision, and working on the interaction between the two may lead to advances in both computer vision and natural language generation.  My prototype vision-to-language system, largely developed during the Hopkins summer workshop 2011 in collaboration with vision researchers at Stony Brook and language researchers at U. Maryland, is available at:  <a href="http://recognition.cs.stonybrook.edu:8080/%7Emitchema/midge/">http://recognition.cs.stonybrook.edu:8080/~mitchema/midge/</a></p>
<p>&nbsp;</p>
<p>BIO:</p>
<p>Meg Mitchell graduated with a Bachelor&#8217;s in Linguistics from Reed College in 2005.  Since then, she has worked at the Center for Spoken Language Understanding at Oregon Health and Science University, helping to automatically diagnose neurological disorders by analyzing the syntactic and phonetic characteristics of spoken language.  She has also received a Master&#8217;s in Computational Linguistics from the University of Washington and is working towards a Ph.D in Computing Science at the University of Aberdeen.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/hltcoe.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/hltcoe.wordpress.com/360/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/hltcoe.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/hltcoe.wordpress.com/360/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/hltcoe.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/hltcoe.wordpress.com/360/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/hltcoe.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/hltcoe.wordpress.com/360/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/hltcoe.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/hltcoe.wordpress.com/360/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/hltcoe.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/hltcoe.wordpress.com/360/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/hltcoe.wordpress.com/360/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/hltcoe.wordpress.com/360/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=360&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://hltcoe.jhu.edu/2012/02/21/march-5-2012-margaret-mitchell/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/308742bc09c1585d68805c9dd233753e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kdaught2</media:title>
		</media:content>
	</item>
		<item>
		<title>Feb. 27, 2012: Kevin Gimpel: Statistical Modeling and Learning for Machine Translation</title>
		<link>http://hltcoe.jhu.edu/2012/02/16/feb-27-2012-statistical-modeling-and-learning-for-machine-translation-kevin-gimpel/</link>
		<comments>http://hltcoe.jhu.edu/2012/02/16/feb-27-2012-statistical-modeling-and-learning-for-machine-translation-kevin-gimpel/#comments</comments>
		<pubDate>Thu, 16 Feb 2012 14:23:36 +0000</pubDate>
		<dc:creator>kdaught2</dc:creator>
				<category><![CDATA[Talks]]></category>

		<guid isPermaLink="false">http://hltcoe.jhu.edu/?p=347</guid>
		<description><![CDATA[Kevin Gimpel , Carnegie Mellon University “Statistical Modeling and Learning for Machine Translation” Recent years have seen a flurry of research in translation modeling for statistical machine translation. Widely-used approaches include (1) models based on flat phrase-to-phrase mappings, and (2) models that use syntactic structure. I will present an approach that combines the strengths of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=347&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Kevin Gimpel , Carnegie Mellon University</p>
<p style="text-align:left;" align="center">“Statistical Modeling and Learning for Machine Translation”</p>
<p>Recent years have seen a flurry of research in translation modeling for statistical machine translation. Widely-used approaches include (1) models based on flat phrase-to-phrase mappings, and (2) models that use syntactic structure. I will present an approach that combines the strengths of these two in a single model. Experiments show that it leads to improved translation quality over state-of-the-art systems. When supervised syntactic parsers are not available, I will show that unsupervised parsers can be substituted with minimal loss in translation quality. I will also discuss the problem of tuning model parameters. An abundance of techniques has been developed by the statistical machine learning community, but the machine translation problem is fundamentally different from other structure prediction tasks in key ways. These differences have caused well-known learning algorithms to lose theoretical guarantees when applied to machine translation. I will propose a novel algorithm that does offer theoretical guarantees and shows empirical advantage over state-of-the-art approaches.</p>
<p>BIO:</p>
<p>Kevin Gimpel is a PhD student in the Language Technologies Institute at Carnegie Mellon University where he is advised by Noah Smith. His research focuses on machine translation, with supporting interests in natural language processing and machine learning. He is also interested in new tasks involving emerging data sources, including social media, movie reviews, and restaurant menus. He interned with the machine translation team at Google during summer 2009 and has been a Sandia National Laboratories Excellence in Science and Technology Fellow since 2010.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/hltcoe.wordpress.com/347/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/hltcoe.wordpress.com/347/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/hltcoe.wordpress.com/347/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/hltcoe.wordpress.com/347/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/hltcoe.wordpress.com/347/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/hltcoe.wordpress.com/347/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/hltcoe.wordpress.com/347/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/hltcoe.wordpress.com/347/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/hltcoe.wordpress.com/347/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/hltcoe.wordpress.com/347/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/hltcoe.wordpress.com/347/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/hltcoe.wordpress.com/347/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/hltcoe.wordpress.com/347/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/hltcoe.wordpress.com/347/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=347&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://hltcoe.jhu.edu/2012/02/16/feb-27-2012-statistical-modeling-and-learning-for-machine-translation-kevin-gimpel/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/308742bc09c1585d68805c9dd233753e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kdaught2</media:title>
		</media:content>
	</item>
		<item>
		<title>Feb. 20, 2012: Lidan Wang: Learning to Efficiently Rank</title>
		<link>http://hltcoe.jhu.edu/2012/02/09/feb-20-2012-learning-to-efficiently-rank-lidan-wang/</link>
		<comments>http://hltcoe.jhu.edu/2012/02/09/feb-20-2012-learning-to-efficiently-rank-lidan-wang/#comments</comments>
		<pubDate>Thu, 09 Feb 2012 17:07:01 +0000</pubDate>
		<dc:creator>kdaught2</dc:creator>
				<category><![CDATA[Talks]]></category>

		<guid isPermaLink="false">http://hltcoe.jhu.edu/?p=340</guid>
		<description><![CDATA[Lidan Wang, University of Maryland “Learning to Efficiently Rank” Technological advances have led to increases in the types and amounts of data, and there is a great need for developing methods to manage and find relevant information from such data to satisfy user&#8217;s information needs. Learning to rank is an emerging discipline at the intersection [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=340&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Lidan Wang, University of Maryland</p>
<p>“Learning to Efficiently Rank”</p>
<p>Technological advances have led to increases in the types and amounts of data, and there is a great need for developing methods to manage and find relevant information from such data to satisfy user&#8217;s information needs. Learning to rank is an emerging discipline at the intersection of machine learning, data mining, and information retrieval. It develops principled machine learning algorithms to construct ranking (i.e., retrieval) models, for finding and ranking relevant information to user queries over large amounts of data. Although learning to rank approaches are capable of learning highly effective ranking functions, they have mostly ignored the important issue of model efficiency (i.e., model speed). Given that efficiency and effectiveness are competing forces that often counteract each other, models that are optimized for effectiveness alone may not meet the strict efficiency requirements when dealing with real-world large-scale data-sets.</p>
<p>My PhD thesis introduces the Learning to Efficiently Rank framework for learning large-scale ranking models that facilitate fast and effective retrieval, by exploiting and optimizing the tradeoffs between model complexity (i.e., speed) and accuracy.  At a basic level, this framework learns ranking models whose speed and accuracy can be explicitly controlled.  I proposed and designed solutions for three problems within this framework: 1) learning large-scale ranking models according to a desired tradeoff between model speed and accuracy; 2) constructing temporally-constrained models capable of returning results under time budgets; 3) breaking through the speed/accuracy tradeoff barrier by developing a novel cascade ranking model, and learning the cascade model structure and parameters with a novel boosting-based learning algorithm. My research extends the conventional effectiveness-centric approach in model learning and takes an efficiency-minded look at building effective retrieval models.  Results show that models learned this way significantly outperform traditional machine-learned models in terms of speed without sacrificing result effectiveness. Moreover, the new models work particularly well when users impose stringent time requirements for ranked retrieval on very large data-sets.</p>
<p>Bio:</p>
<p>Lidan Wang is a PhD candidate in the Computer Science Department at the University of Maryland, College Park. She received her Master&#8217;s degree from the Department of Computer Science at the University of Wisconsin, Madison, and Bachelor&#8217;s degree from the Department of Computer Science at the University of Florida. Lidan&#8217;s research interests lie at the intersection of machine learning, information retrieval, and text and data mining. Lidan&#8217;s work focuses on designing large-scale machine learning and information retrieval techniques for learning, mining, and retrieving information at scale. Her PhD dissertation research led to a recent NSF research grant (IIS-1144034), which she co-authored.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/hltcoe.wordpress.com/340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/hltcoe.wordpress.com/340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/hltcoe.wordpress.com/340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/hltcoe.wordpress.com/340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/hltcoe.wordpress.com/340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/hltcoe.wordpress.com/340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/hltcoe.wordpress.com/340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/hltcoe.wordpress.com/340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/hltcoe.wordpress.com/340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/hltcoe.wordpress.com/340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/hltcoe.wordpress.com/340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/hltcoe.wordpress.com/340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/hltcoe.wordpress.com/340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/hltcoe.wordpress.com/340/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=340&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://hltcoe.jhu.edu/2012/02/09/feb-20-2012-learning-to-efficiently-rank-lidan-wang/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/308742bc09c1585d68805c9dd233753e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kdaught2</media:title>
		</media:content>
	</item>
		<item>
		<title>Feb. 13, 2012: Burr Settles: Interactive Machine Learning: Combining Learning Strategies with Humans in the Loop</title>
		<link>http://hltcoe.jhu.edu/2012/02/07/feb-13-2012-burr-settles-interactive-machine-learning-combining-learning-strategies-with-humans-in-the-loop/</link>
		<comments>http://hltcoe.jhu.edu/2012/02/07/feb-13-2012-burr-settles-interactive-machine-learning-combining-learning-strategies-with-humans-in-the-loop/#comments</comments>
		<pubDate>Tue, 07 Feb 2012 18:46:19 +0000</pubDate>
		<dc:creator>kdaught2</dc:creator>
				<category><![CDATA[Talks]]></category>

		<guid isPermaLink="false">http://hltcoe.jhu.edu/?p=335</guid>
		<description><![CDATA[Burr Settles, Carnegie Mellon University Interactive Machine Learning: Combining Learning Strategies with Humans in the Loop People learn by interacting with their teachers.  Why not machines?  What would it take to develop software that can learn how to solve problems by interacting and collaborating with humans?  This talk will describe my efforts to develop such [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=335&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p style="text-align:left;" align="center">Burr Settles, Carnegie Mellon University</p>
<p style="text-align:left;">Interactive Machine Learning: Combining Learning Strategies with Humans in the Loop</p>
<p style="text-align:left;">People learn by interacting with their teachers.  Why not machines?  What would it take to develop software that can learn how to solve problems by interacting and collaborating with humans?  This talk will describe my efforts to develop such systems, with the goal of training effective machine learners more quickly and economically.  In particular, I focus on two projects in natural language processing that combine multiple learning strategies: incorporating domain knowledge (taking advice in the form of human-provided rules), active learning (asking &#8220;questions&#8221; of human annotators), and semi-supervised learning (attempting to &#8220;teach itself&#8221; by extrapolating what has been learned onto abundant, unlabeled data).  Empirical results from user experiments show that these approaches are superior to their state-of-the-art &#8220;passive&#8221; learning counterparts.  Interestingly, these experiments provide initial insights into human &#8220;teaching&#8221; behavior as well, suggesting ways in which human factors can and should be taken into account.  I will also briefly discuss opportunities for interactive learning in other areas, such as supporting online communities, creative work, and biological discovery.</p>
<p>Bio:</p>
<p>Burr Settles is a Postdoctoral Fellow in the Machine Learning Department at Carnegie Mellon University. He received a PhD in Computer Sciences from the University of Wisconsin-Madison in 2008, with additional studies in Linguistics and Biology. His current research focuses on interactive machine learning that resembles a &#8220;dialogue&#8221; of decision-making and knowledge acquisition between computers and humans, with applications in natural language processing, biology, and social computing. He recently organized workshops at the ICML and NAACL conferences on these topics, and is the author of a popular literature survey on active learning (active-learning.net). He also runs the website FAWM.ORG, prefers sandals to shoes, and plays guitar in the Pittsburgh pop band Delicious Pastries.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/hltcoe.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/hltcoe.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/hltcoe.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/hltcoe.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/hltcoe.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/hltcoe.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/hltcoe.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/hltcoe.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/hltcoe.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/hltcoe.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/hltcoe.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/hltcoe.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/hltcoe.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/hltcoe.wordpress.com/335/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=335&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://hltcoe.jhu.edu/2012/02/07/feb-13-2012-burr-settles-interactive-machine-learning-combining-learning-strategies-with-humans-in-the-loop/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/308742bc09c1585d68805c9dd233753e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">kdaught2</media:title>
		</media:content>
	</item>
		<item>
		<title>Feb. 6, 2012: Chris Dyer: Generate-and-Test Models for Alignment and Machine Translation</title>
		<link>http://hltcoe.jhu.edu/2012/01/27/feb-6-2012-chris-dyer-generate-and-test-models-for-alignment-and-machine-translation/</link>
		<comments>http://hltcoe.jhu.edu/2012/01/27/feb-6-2012-chris-dyer-generate-and-test-models-for-alignment-and-machine-translation/#comments</comments>
		<pubDate>Fri, 27 Jan 2012 06:14:29 +0000</pubDate>
		<dc:creator>mdredze</dc:creator>
				<category><![CDATA[Talks]]></category>

		<guid isPermaLink="false">http://hltcoe.jhu.edu/?p=331</guid>
		<description><![CDATA[Chris Dyer, Carnegie Mellon University Generate-and-Test Models for Alignment and Machine Translation I discuss translation as an optimization problem subject to three kinds of constraints: lexical, configurational, and constraints enforcing target-language wellformedness. Lexical constraints ensure that the lexical choices in the output are meaning-preserving; configurational constraints ensure that the relationships between source words and phrases [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=331&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Chris Dyer, Carnegie Mellon University</p>
<p>Generate-and-Test Models for Alignment and Machine Translation</p>
<p>I discuss translation as an optimization problem subject to three kinds of constraints: lexical, configurational, and constraints enforcing target-language wellformedness. Lexical constraints ensure that the lexical choices in the output are meaning-preserving; configurational constraints ensure that the relationships between source words and phrases (e.g., semantic roles and modifier-headrelationships) are properly transformed in translation; and target-language wellformedness constraints ensure the grammaticality of the output. In terms of the traditional source-channel model of Brown et al. (1993), the &#8220;translation model&#8221; encodes lexical and configurational constraints and the &#8220;language model&#8221; encodes target language wellformedness constraints. On the other hand, the constraint-based framework suggests a generate-and-test (discriminative) model of translation in which features sensitive to input and output structures, and the feature weights are trained to maximize the (conditional) likelihood of a corpus of example translations. The specified features represent empirical hypotheses about what variables correlate (but not why) and thus encode domain-specific knowledge that is useful for the problem at hand; the learned weights indicate to what extent these hypotheses are confirmed or refuted.</p>
<p>To verify the usefulness of the feature-based approach, I discuss the performance two models: first, a lexical translation model evaluated by the word alignments it learns. Unlike previous unsupervised alignment models, the new model utilizes features that capture diverse lexical and alignment relationships, including morphological relatedness, orthographic similarity, and conventional co-occurrence statistics. Results from typologically diverse language pairs demonstrate that the generate-and-test model provides substantial performance benefits compared to state-of-the-art generative baselines.  Second, I discuss the results of an end-to-end translation model in which lexical, configurational, and wellformedness constraints are modeled independently. Because of the independence assumptions, the model is substantially more compact than state-of-the-art translation models, but still performs significantly better on languages where source-target word order differences are substantial.</p>
<p>Bio:</p>
<p>Chris Dyer is a postdoctoral researcher in Noah Smith&#8217;s lab in the Language Technologies Institute at Carnegie Mellon University. He completed his PhD on statistical machine translation with Philip Resnik at the University of Maryland in 2010. Together with Jimmy Lin, he is author of &#8220;Data-Intensive Text Processing with MapReduce&#8221;, published by Morgan &amp; Claypool in 2010. Current research interests include machine translation, unsupervised learning, Bayesian techniques, and &#8220;big data&#8221; problems in NLP.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/hltcoe.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/hltcoe.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/hltcoe.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/hltcoe.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/hltcoe.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/hltcoe.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/hltcoe.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/hltcoe.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/hltcoe.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/hltcoe.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/hltcoe.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/hltcoe.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/hltcoe.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/hltcoe.wordpress.com/331/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=331&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://hltcoe.jhu.edu/2012/01/27/feb-6-2012-chris-dyer-generate-and-test-models-for-alignment-and-machine-translation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/8f8c95a342acedf2d2875563d79e58e7?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mdredze</media:title>
		</media:content>
	</item>
		<item>
		<title>Oct. 18, 2011 &#8211; COE Technical Exchange</title>
		<link>http://hltcoe.jhu.edu/2011/10/04/coe-technical-exchange-oct-18-2011/</link>
		<comments>http://hltcoe.jhu.edu/2011/10/04/coe-technical-exchange-oct-18-2011/#comments</comments>
		<pubDate>Tue, 04 Oct 2011 16:06:16 +0000</pubDate>
		<dc:creator>hltcoe</dc:creator>
				<category><![CDATA[Technical Meetings]]></category>

		<guid isPermaLink="false">http://hltcoe.wordpress.com/?p=272</guid>
		<description><![CDATA[<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=272&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=272&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://hltcoe.jhu.edu/2011/10/04/coe-technical-exchange-oct-18-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a5eaba11a800c99f8658dfa83d77dfce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">hltcoe</media:title>
		</media:content>
	</item>
		<item>
		<title>June 4 &#8211; August 10, 2012 &#8211; Summer Camp for Advanced Language Exploration</title>
		<link>http://hltcoe.jhu.edu/2011/10/04/scale-2012-june-4-august-10-2012/</link>
		<comments>http://hltcoe.jhu.edu/2011/10/04/scale-2012-june-4-august-10-2012/#comments</comments>
		<pubDate>Tue, 04 Oct 2011 16:05:24 +0000</pubDate>
		<dc:creator>hltcoe</dc:creator>
				<category><![CDATA[Workshops]]></category>

		<guid isPermaLink="false">http://hltcoe.wordpress.com/?p=270</guid>
		<description><![CDATA[<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=270&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=270&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://hltcoe.jhu.edu/2011/10/04/scale-2012-june-4-august-10-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a5eaba11a800c99f8658dfa83d77dfce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">hltcoe</media:title>
		</media:content>
	</item>
		<item>
		<title>Oct. 4, 2011 &#8211; Author Attritube Detection Workshop</title>
		<link>http://hltcoe.jhu.edu/2011/10/04/author-attritube-detection-workshop-oct-4-2011/</link>
		<comments>http://hltcoe.jhu.edu/2011/10/04/author-attritube-detection-workshop-oct-4-2011/#comments</comments>
		<pubDate>Tue, 04 Oct 2011 15:55:44 +0000</pubDate>
		<dc:creator>hltcoe</dc:creator>
				<category><![CDATA[Technical Meetings]]></category>

		<guid isPermaLink="false">http://hltcoe.wordpress.com/?p=268</guid>
		<description><![CDATA[<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=268&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=268&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://hltcoe.jhu.edu/2011/10/04/author-attritube-detection-workshop-oct-4-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a5eaba11a800c99f8658dfa83d77dfce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">hltcoe</media:title>
		</media:content>
	</item>
		<item>
		<title>Sept. 2, 2011:  Dan Povey: Applications of Weighted Finite State Transducers in a Speech Recognition Toolkit</title>
		<link>http://hltcoe.jhu.edu/2011/09/02/sept-2-2011-dan-povey-applications-of-weighted-finite-state-transducers-in-a-speech-recognition-toolkit/</link>
		<comments>http://hltcoe.jhu.edu/2011/09/02/sept-2-2011-dan-povey-applications-of-weighted-finite-state-transducers-in-a-speech-recognition-toolkit/#comments</comments>
		<pubDate>Fri, 02 Sep 2011 12:31:47 +0000</pubDate>
		<dc:creator>hltcoe</dc:creator>
				<category><![CDATA[Talks]]></category>

		<guid isPermaLink="false">http://hltcoe.wordpress.com/?p=194</guid>
		<description><![CDATA[Dan Povey, Microsoft Research Applications of weighted finite state transducers in a speech recognition toolkit The open-source speech recognition toolkit &#8220;Kaldi&#8221; uses weighted finite state  transducer (WFSTs) for training and decoding, and uses the OpenFst toolkit as a  C++ library.  I will give an informal overview of WFSTs and of the standard AT&#38;T  recipe for [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=194&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Dan Povey, Microsoft Research</p>
<p>Applications of weighted finite state transducers in a speech recognition toolkit</p>
<p>The open-source speech recognition toolkit &#8220;Kaldi&#8221; uses weighted finite state  transducer (WFSTs) for training and decoding, and uses the OpenFst toolkit as a  C++ library.  I will give an informal overview of WFSTs and of the standard AT&amp;T  recipe for WFST based decoding, and will mention some problems (in my opinion)  with the basic recipe and how we addressed them while developing Kaldi.  I will  also describe how to use WFSTs to acheive &#8220;exact&#8221; lattice generation, in a sense  will be explained.  This is an interesting application of WFSTs because, unlike  most WFST mechanisms, it does not have any obvious non-WFST analog.</p>
<p>Bio:   Daniel Povey received his Bachelor&#8217;s (Natural Sciences, 1997), Master&#8217;s  (Computer Speech and Language Processing, 1998) and PhD (Engineering, 2003) from  Cambridge University.  He is currently a researcher at Microsoft Research,  Redmond, Washington, USA.  From 2003 to 2008 he worked as a researcher in IBM  Research in Yorktown Heights, NY.  He is best known for his work on  discriminative training for HMM-GMM based speech recognition (i.e. MMI, MPE, and  their feature-space variants).</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/hltcoe.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/hltcoe.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/hltcoe.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/hltcoe.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/hltcoe.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/hltcoe.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/hltcoe.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/hltcoe.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/hltcoe.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/hltcoe.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/hltcoe.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/hltcoe.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/hltcoe.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/hltcoe.wordpress.com/194/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=hltcoe.jhu.edu&amp;blog=26761117&amp;post=194&amp;subd=hltcoe&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://hltcoe.jhu.edu/2011/09/02/sept-2-2011-dan-povey-applications-of-weighted-finite-state-transducers-in-a-speech-recognition-toolkit/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a5eaba11a800c99f8658dfa83d77dfce?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">hltcoe</media:title>
		</media:content>
	</item>
	</channel>
</rss>
