Publications

Noisy BiLSTM-Based Models for Disfluency Detection, Nguyen Bach & Fei Huang, INTERSPEECH 2019

(2022). Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning. In NeurIPS-2022.

Preprint PDF

(2022). ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition. In NAACL-2022.

PDF Code Poster Slides

(2022). The innovation behind AI at Scale. In Microsoft-2022.

PDF Project

(2021). MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations. In EMNLP-2021.

PDF Code

(2021). Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor. In ACL-2021.

PDF Code Project

(2021). Risk Minimization for Zero-shot Sequence Labeling. In ACL-2021.

PDF

(2021). Multi-View Cross-Lingual Structured Prediction with Minimum Supervision. In ACL-2021.

PDF

(2021). Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning. In ACL-2021.

PDF

(2021). Automated Concatenation of Embeddings for Structured Prediction. In ACL-2021.

PDF Code Project

(2020). More Embeddings, Better Sequence Labelers?. In *EMNLP Findings 2020 *.

PDF

(2020). An Investigation of Potential Function Designs for Neural CRF. In *EMNLP Findings 2020 *.

PDF

(2020). AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network. In EMNLP 2020.

PDF

(2020). Structure-Level Knowledge Distillation For Multilingual Sequence Labeling. In ACL 2020.

PDF Slides video

(2020). FINDINGS OF THE IWSLT 2020 EVALUATION CAMPAIGN. In IWSLT 2020.

PDF

(2019). Noisy BiLSTM-Based Models for Disfluency Detection. In Interspeech 2019.

PDF Code Poster

(2019). Unsupervised Multi-Modal Neural Machine Translation. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF

(2018). Alibaba Speech Translation Systems for IWSLT 2018. Proceedings of the International Workshop on Spoken Language Translation.

PDF

(2016). Eye Tracking Using Video Information and Electrooculography Information. United States Patent & Trademark Office.

Patent

(2012). The SDL Language Weaver Systems in the WMT12 Quality Estimation Shared Task. Proceedings of the Seventh Workshop on Statistical Machine Translation.

PDF Slides

(2011). TriS: A Statistical Sentence Simplifier with Log-linear Models and Margin-based Discriminative Training. Proceedings of 5th International Joint Conference on Natural Language Processing.

PDF Slides

(2011). CMU Haitian Creole-English Translation System for WMT 2011. Proceedings of the Sixth Workshop on Statistical Machine Translation.

PDF Slides

(2011). Goodness: A Method for Measuring Machine Translation Confidence. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies.

PDF Slides Visualization

(2010). A Semi-Supervised Word Alignment Algorithm with Partial Manual Alignments. Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR.

PDF Slides

(2009). Virtual Babel: Towards Context-Aware Machine Translation in Virtual Worlds. Proceedings of the Twelfth Machine Translation Summit (MTSummit-XII).

PDF Poster

(2009). Source-side Dependency Tree Reordering Models with Subtree Movements and Constraints. Proceedings of the Twelfth Machine Translation Summit (MTSummit-XII).

PDF Slides

(2009). Incremental Adaptation of Speech-to-Speech Translation. Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers.

PDF Poster

(2009). Cohesive Constraints in A Beam Search Phrase-based Decoder. Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers.

PDF Slides

(2008). Recent Improvements in the CMU Large Scale Chinese-English SMT System. Proceedings of ACL-08: HLT, Short Papers.

PDF Slides

(2008). Improving Word Alignment with Language Model Based Confidence Scores. Proceedings of the Third Workshop on Statistical Machine Translation.

PDF Slides

(2007). A Log-Linear Block Transliteration Model based on Bi-Stream HMMs. Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference.

PDF Slides

(2007). A Survey on Relation Extraction. Language Technologies Institute, Carnegie Mellon University.

PDF Slides

(2007). The UKA-CMU Statistical Machine Translation Systems for IWSLT 2007. Proc. of the International Workshop on Spoken Language Translation.

PDF Slides

(2007). The CMU TransTac 2007 Eyes-free and Hands-free Two-way Speech-to-Speech Translation System. Proc. of the International Workshop on Spoken Language Translation.

PDF Slides

(2007). Handling OOV Words in Arabic ASR Via Flexible Morphological Constraints. Proceedings of the Interspeech2007.

PDF Slides

(2006). The UKA/CMU Statistical Machine Translation System for IWSLT 2006. Proc. of the International Workshop on Spoken Language Translation.

PDF Slides

(2003). Quantitative Analysis and Synthesis of Syllabic Tones in Vietnamese. Proceedings of the 8th European Conference on Speech Communication and Technology.

PDF