Contact
nasmithcs.washington.edu
Areas of interest: Natural language processing
Open Extraction of Fine-Grained Political Opinion
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2015.
, Improved Transition-based Parsing by Modeling Characters instead of Words with LSTMs
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2015.
, A Utility Model of Authors in the Scientific Community
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2015.
, Extractive Summarization by Maximizing Semantic Volume
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2015.
, Bayesian Optimization of Text Representations
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2015.
, Learning Word Representations with Hierarchical Sparse Coding
Proceedings of the International Conference on Machine Learning, 2015.
, Sparse Binary Word Vector Representations
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2015.
, Transition-Based Dependency Parsing with Stack Long Short-Term Memory
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2015.
, Frame-Semantic Role Labeling with Heterogeneous Annotations
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2015.
, The Media Frames Corpus: Annotations of Frames Across Issues
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2015.
, A Supertag-Context Model for Weakly-Supervised CCG Parser Learning
Proceedings of the Conference on Computational Natural Language Learning, 2015.
, Retrofitting Word Vectors to Semantic Lexicons
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2015.
, \ \textbf{Best student paper award.}
A Corpus and Model Integrating Multiword Expressions and Supersenses
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2015.
, Toward Abstractive Summarization Using Semantic Representations
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2015.
, Transforming Dependencies into Phrase Structures
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2015.
, Contextualized Sarcasm Detection on Twitter
Proceedings of the International AAAI Conference on Weblogs and Social Media, 2015.
, Modeling User Arguments, Interactions, and Attributes for Stance Prediction in Online Debate Forums
Proceedings of the SIAM Conference on Data Mining, 2015.
, AD$^3$: Alternating Directions Dual Decomposition for MAP Inference in Graphical Models
Journal of Machine Learning Research 16, 2015.
, Weakly-Supervised Grammar-Informed Bayesian CCG Parser Learning
Proceedings of the AAAI Conference on Artificial Intelligence, 2015.
, The Utility of Text: The Case of Amicus Briefs and the Supreme Court
Proceedings of the AAAI Conference on Artificial Intelligence, 2015.
, Conditional Random Field Autoencoders for Unsupervised Structured Prediction
Advances in Neural Information Processing Systems 27, 2014.
, \ \textbf{Selected for oral presentation (top 5% of accepted papers).}
Unsupervised Discovery of Biographical Structure from Text
Transactions of the Association for Computational Linguistics 2:2014, 2014.
, A Dependency Parser for Tweets
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2014.
, A Step Towards Usable Privacy Policy: Automatic Alignment of Privacy Statements
Proceedings of the International Conference on Computational Linguistics, 2014.
, CMU: Arc-Factored, Discriminative Semantic Dependency Parsing
Proceedings of the International (COLING) Workshop on Semantic Evaluations, 2014.
, Weakly-Supervised Bayesian Learning of a CCG Supertagger
Proceedings of the Conference on Computational Natural Language Learning, 2014.
, Unsupervised Alignment of Privacy Policies using Hidden Markov Models
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2014.
, Overview of the 2014 NLP Unshared Task in PoliInformatics
Proceedings of the ACL 2014 Workshop on Language Technologies and Computational Social Science, 2014.
, A Discriminative Graph-Based Parser for the Abstract Meaning Representation
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2014.
, \ \textbf{Nominated for best paper award.}
Distributed Representations of Geographically Situated Language
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2014.
, A Bayesian Mixed Effects Model of Literary Character
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2014.
, Simplified Dependency Annotations with GFL-Web
Proceedings of the Annual Meeting of the Association for Computational Linguistics, companion volume, 2014.
, Linguistic Structured Sparsity in Text Categorization
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2014.
, Making the Most of Bag of Words: Sentence Regularization with Alternating Direction Method of Multipliers
Proceedings of the International Conference on Machine Learning, 2014.
, Phrase Dependency Machine Translation with Quasi-Synchronous Tree-to-Tree Features
Computational Linguistics 40:2, 2014.
, Comprehensive Annotation of Multiword Expressions in a Social Web Corpus
Proceedings of the Language Resources and Evaluation Conference, 2014.
, Dynamic Models of Streaming Text
Transactions of the Association for Computational Linguistics 2, 2014.
, Discriminative Lexical Semantic Segmentation with Gaps: Running the MWE Gamut
Transactions of the Association for Computational Linguistics 2, 2014.
, Frame-Semantic Parsing
Computational Linguistics 40:1, 2014.
, Translating into Morphologically Rich Languages with Synthetic Phrases
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2013.
, Learning Topics and Positions from Debatepedia
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2013.
, Measuring Ideological Proportions in Political Speeches
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2013.
, Predicting the NFL Using Twitter
Proceedings of the ECML/PKDD Workshop on (Machine Learning and Data Mining for) Sports Analytics, 2013.
, Learning Latent Personas of Film Characters
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2013.
, Learning to Extract International Relations from Political Context
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2013.
, Testing the Etch-a-Sketch Hypothesis: A Computational Analysis of Mitt Romney's Ideological Makeover During the 2012 Primary vs. General Elections
Presented at the Annual Meeting of the American Political Science Association
A Framework for (Under)specifying Dependency Syntax without Overloading Annotators
Proceedings of the ACL Linguistic Annotation Workshop, 2013.
, Turning on the Turbo: Fast Third-Order Non-Projective Turbo Parsers
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2013.
, A Penny for your Tweets: Campaign Contributions and Capitol Hill Microblogs
Proceedings of the International AAAI Conference on Weblogs and Social Media, 2013.
, Knowledge-Rich Morphological Priors for Bayesian Language Models
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2013.
, \ \textbf{Nominated for best paper award.}
A Simple, Fast, and Effective Reparameterization of IBM Model 2
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2013.
, Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2013.
, Supersense Tagging for Arabic: the MT-in-the-Middle Attack
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2013.
, Linguistic Structure Prediction with the Sparseptron
ACM Crossroads 19:3, 2013.
, Mapping the Geographical Diffusion of New Words
Proceedings of the NIPS Workshop on Social Network and Social Media Analysis: Methods, Models and Applications, 2012.
, Automatic Categorization of Privacy Policies: A Pilot Study
Carnegie Mellon University:CMU-LTI-12-019, 2012.
, pycdec: A Python Interface to cdec
Prague Bulletin of Mathematical Linguistics 98, 2012.
, Empirical Risk Minimization for Probabilistic Grammars: Sample Complexity and Hardness of Learning
Computational Linguistics 38:3, 2012.
, Transliteration by Sequence Labeling with Lattice Encodings and Reranking
Proceedings of the ACL Named Entities Workshop, 2012.
, Word Salad: Relating Food Prices and Descriptions
Proceedings of the Conference on Empirical Methods in Natural Language Processing and Natural Language Learning, 2012.
, Discovering Factions in the Computational Linguistics Community
Proceedings of the ACL Workshop on Rediscovering Fifty Years of Discoveries, 2012.
, Coarse Lexical Semantic Annotation with Supersenses: An Arabic Case Study
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2012.
, A Probabilistic Model for Canonicalizing Named Entity Mentions
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2012.
, Concavity and Initialization for Unsupervised Dependency Parsing
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2012.
, Textual Predictors of Bill Survival in Congressional Committees
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2012.
, Structured Ramp Loss Minimization for Machine Translation
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2012.
, An Exact Dual Decomposition Algorithm for Shallow Semantic Parsing with Constraints
Proceedings of the Joint Conference on Lexical and Computational Semantics, 2012.
, Graph-Based Lexicon Expansion with Sparsity-Inducing Penalties
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, 2012.
, Recall-Oriented Learning of Named Entities in Arabic Wikipedia
Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics, 2012.
, Censorship and Content Deletion in Chinese Social Media
First Monday 17:3, 2012.
, Computational Text Analysis for Social Science: Model Complexity and Assumptions
Proceedings of the NIPS Workshop on Computational Social Science and the Wisdom of Crowds, 2011.
, Quasi-Synchronous Phrase Dependency Grammars for Machine Translation
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2011.
, Structured Databases of Named Entities from Bayesian Nonparametrics
Proceedings of the EMNLP Workshop on Unsupervised Learning in NLP, 2011.
, Predicting a Scientific Community's Response to an Article
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2011.
, Unsupervised Bilingual POS Tagging with Markov Random Fields
Proceedings of the EMNLP Workshop on Unsupervised Learning in NLP, 2011.
, Unsupervised Structure Prediction with Non-Parallel Multilingual Guidance
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2011.
, The CMU-ARK German-English Translation System
Proceedings of the EMNLP Workshop on Statistical Machine Translation, 2011.
, Structured Sparsity in Structured Prediction
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2011.
, Generative Models of Monolingual and Bilingual Gappy Patterns
Proceedings of the EMNLP Workshop on Statistical Machine Translation, 2011.
, Dual Decomposition with Many Overlapping Components
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2011.
, Author Age Prediction from Text using Linear Regression
Proceedings of the ACL Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, 2011.
, An Augmented Lagrangian Approach to Constrained MAP Inference
Proceedings of the International Conference on Machine Learning, 2011.
, Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability
Proceedings of the Annual Meeting of the Association for Computational Linguistics, companion volume, 2011.
, Discovering Sociolinguistic Associations with Structured Sparsity
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2011.
, Unsupervised Word Alignment with Arbitrary Features
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2011.
, Semi-Supervised Frame-Semantic Parsing for Unknown Predicates
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2011.
, Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments
Proceedings of the Annual Meeting of the Association for Computational Linguistics, companion volume, 2011.
, Linguistic Structure Prediction
Synthesis Lectures on Human Language Technologies, Morgan and Claypool, 2011.
, Online Learning of Structured Predictors with Multiple Kernels
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2011.
, Favor Short Dependencies: Parsing with Soft and Hard Constraints on Dependency Length
Trends in Parsing Technology: Dependency Parsing, Domain Adaptation, and Deep Parsing, Springer 43, 2011.
, Products of Weighted Logic Programs
Theory and Practice of Logic Programming 11:2–3, 2011.
, Empirical Risk Minimization with Approximations of Probabilistic Grammars
Advances in Neural Information Processing Systems 23, 2010.
, Online Multiple Kernel Learning for Structured Prediction
Proceedings of the NIPS Workshop on New Directions in Multiple Kernel Learning, 2010.
, Discovering Demographic Language Variation
Proceedings of the NIPS Workshop on Machine Learning for Social Computing, 2010.
, Augmenting Dual Decomposition for MAP Inference
Proceedings of the International Workshop on Optimization for Machine Learning, 2010.
, Covariance in Unsupervised Learning of Probabilistic Grammars
Journal of Machine Learning Research 11, 2010.
, A Latent Variable Model for Geographic Lexical Variation
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2010.
, Turbo Parsers: Dependency Parsing by Approximate Variational Inference
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2010.
, Nonparametric Word Segmentation for Machine Translation
Proceedings of the International Conference on Computational Linguistics, 2010.
, \ \textbf{Best paper finalist.}
SEMAFOR: Frame Argument Resolution with Log-Linear Models
Proceedings of the International (ACL) Workshop on Semantic Evaluations, 2010.
, Visualizing Topical Quotations Over Time to Understand News Discourse
Carnegie Mellon University:CMU-LTI-10-013, 2010.
, Viterbi Training for PCFGs: Hardness Results and Competitiveness of Uniform Initialization
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2010.
, Tree Edit Models for Recognizing Textual Entailments, Paraphrases, and Answers to Questions
Proceedings of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference, 2010.
, Variational Inference for Adaptor Grammars
Proceedings of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference, 2010.
, Softmax-Margin Training for Structured Log-Linear Models
Carnegie Mellon University:CMU-LTI-10-008, 2010.
, Movie Reviews and Revenues: An Experiment in Text Regression
Proceedings of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference, 2010.
, Good Question! Statistical Ranking for Question Generation
Proceedings of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference, 2010.
, Extracting Simplified Statements for Factual Question Generation
Proceedings of the AIED Workshop on Question Generation, 2010.
, Softmax-Margin CRFs: Training Log-Linear Models with Cost Functions
Proceedings of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference, 2010.
, Probabilistic Frame-Semantic Parsing
Proceedings of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference, 2010.
, Shedding (a Thousand Points of) Light on Biased Language
Proceedings of the NAACL-HLT Workshop on Creating Speech and Language Data With Mechanical Turk, 2010.
, Rating Computer-Generated Questions with Mechanical Turk
Proceedings of the NAACL-HLT Workshop on Creating Speech and Language Data With Mechanical Turk, 2010.
, Aggressive Online Learning of Structured Classifiers
Carnegie Mellon University:CMU-ML-10-109, 2010.
, From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series
Proceedings of the International AAAI Conference on Weblogs and Social Media, 2010.
, What's Worthy of Comment? Content and Comment Volume in Political Blogs
Proceedings of the International AAAI Conference on Weblogs and Social Media, 2010.
, Text-Driven Forecasting
, 2010.Concise Integer Linear Programming Formulations for Dependency Parsing
Proceedings of the Joint Conference of the Annual Meeting of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing, 2009.
, \ \textbf{Best paper award.}
Variational Inference for Grammar Induction with Prior Knowledge
Proceedings of the Joint Conference of the Annual Meeting of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing, companion volume, 2009.
, Feature-Rich Translation by Quasi-Synchronous Lattice Parsing
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2009.
, Leveraging Structural Relations for Fluent Compressions at Multiple Compression Rates
Proceedings of the Joint Conference of the Annual Meeting of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing, companion volume, 2009.
, Paraphrase Identification as Probabilistic Quasi-Synchronous Recognition
Proceedings of the Joint Conference of the Annual Meeting of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing, 2009.
, Ranking Automatically Generated Questions as a Shared Task
Proceedings of the AIED Workshop on Question Generation, 2009.
, Summarization with a Joint Model for Sentence Extraction and Compression
Proceedings of the NAACL-HLT Workshop on Integer Linear Programming for Natural Language Processing, 2009.
, Polyhedral Outer Approximations with Application to Natural Language Parsing
Proceedings of the International Conference on Machine Learning, 2009.
, Question Generation via Overgenerating Transformations and Ranking
Carnegie Mellon University:CMU-LTI-09-013, 2009.
, Predicting Response to Political Blog Posts with Topic Models
Proceedings of the North American Association for Computational Linguistics Human Language Technologies Conference, 2009.
, Preference Grammars: Softening Syntactic Constraints to Improve Statistical Machine Translation
Proceedings of the North American Association for Computational Linguistics Human Language Technologies Conference, 2009.
, Predicting Risk from Financial Reports with Regression
Proceedings of the North American Association for Computational Linguistics Human Language Technologies Conference, 2009.
, Shared Logistic Normal Distributions for Soft Parameter Tying in Unsupervised Grammar Induction
Proceedings of the North American Association for Computational Linguistics Human Language Technologies Conference, 2009.
, From Episodes to Sagas: Understanding the News by Identifying Temporally Related Story Sequences
Proceedings of the International AAAI Conference on Weblogs and Social Media, 2009.
, Nonextensive Information Theoretic Kernels on Measures
Journal of Machine Learning Research 10, 2009.
, Cube Summing, Approximate Inference with Non-Local Features, and Dynamic Programming without Semirings
Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics, 2009.
, Logistic Normal Priors for Unsupervised Probabilistic Grammar Induction
Advances in Neural Information Processing Systems 21, 2008.
, The Shared Logistic Normal Distribution for Grammar Induction
Proceedings of the NIPS Workshop on Speech and Language: Unsupervised Latent-Variable Models, 2008.
, Dynamic Programming Algorithms as Products of Weighted Logic Programs
Proceedings of the International Conference on Logic Programming, 2008.
, \ \textbf{Best student paper award.}
Stacking Dependency Parsers
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2008.
, Wider Pipelines: $N$-Best Alignments and Parses in MT Training
Proceedings of the Conference of the Association for Machine Translation in the Americas, 2008.
, Question Generation as a Competitive Undergraduate Course Project
Proceedings of the NSF Workshop on the Question Generation Shared Task and Evaluation Challenge, 2008.
, Review of ooktitlestyleComputational Approaches to Morphology and Syntax by Brian Roark and Richard Sproat
Computational Linguistics 34:3, 2008.
, Nonextensive Entropic Kernels
Proceedings of the International Conference on Machine Learning, 2008.
, Competitive Grammar Writing
Proceedings of the ACL Workshop on Issues in Teaching Computational Linguistics, 2008.
, Rich Source-Side Context for Statistical Machine Translation
Proceedings of the ACL Workshop on Statistical Machine Translation, 2008.
, \ \textbf{Five-year retrospective best paper award.}
SOUR CREAM: Toward Semantic Processing of Recipes
Carnegie Mellon University:CMU-LTI-08-005, 2008.
, Relative Keyboard Input System
Proceedings of the International Conference on Intelligent User Interfaces, 2008.
, Weighted and Probabilistic Context-Free Grammars Are Equally Expressive
Computational Linguistics 33:4, 2007.
, What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA
Proceedings of the Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2007.
, \ \textbf{Nominated for best paper award.}
Joint Morphological and Syntactic Disambiguation
Proceedings of the Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2007.
, Probabilistic Models of Nonprojective Dependency Trees
Proceedings of the Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2007.
, Computationally Efficient M-Estimation of Log-Linear Structure Models
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2007.
, Novel Estimation Methods for Unsupervised Discovery of Latent Structure in Natural Language Text
Department of Computer Science, Johns Hopkins University, 2006.
, Supervised by Jason Eisner
Annealing Structural Bias in Multilingual Weighted Grammar Induction
Proceedings of the International Conference on Computational Linguistics and Annual Meeting of the Association for Computational Linguistics, 2006.
, Vine Parsing and Minimum Risk Reranking for Speed and Precision
Proceedings of the Conference on Natural Language Learning, 2006.
, Parsing with Soft and Hard Constraints on Dependency Length
Proceedings of the International Workshop on Parsing Technologies, 2005.
, Context-Based Morphological Disambiguation with Random Fields
Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, 2005.
, Compiling Comp Ling: Practical Weighted Dynamic Programming and the Dyna Language
Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, 2005.
, Guiding Unsupervised Grammar Induction Using Contrastive Estimation
Proceedings of the IJCAI Workshop on Grammatical Inference Applications, 2005.
, Contrastive Estimation: Training Log-Linear Models on Unlabeled Data
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2005.
, \ \textbf{Nominated for best paper award.}
Dyna: A Declarative Language for Implementing Dynamic Programs
Proceedings of the Annual Meeting of the Association for Computational Linguistics, companion volume, 2004.
, Bilingual Parsing with Factored Estimation: Using English to Parse Korean
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2004.
, Annealing Techniques for Unsupervised Statistical Language Learning
Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2004.
, From Words to Corpora: Recognizing Translation
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2002.
, Ellipsis Happens, and Deletion is How
University of Maryland Working Papers in Linguistics, Department of Linguistics, University of Maryland 11, 2001.
, Undergraduate honors thesis, supervised by Norbert Hornstein
Detection of Translational Equivalence
Department of Computer Science, University of Maryland College Park:4253, 2001.
, Undergraduate honors thesis, supervised by Philip Resnik
Cairo: An Alignment Visualization Tool
Proceedings of the Language Resources and Evaluation Conference, 2000.
, Statistical Machine Translation
Johns Hopkins University:42, 1999.
,