Biography

I am currently a principal engineer at Microsoft GenAI. My main focus is to bring GenAI to practically serve Microsoft and its custmers on realistic scenarios. In the last few years, I

help to drive the creation and release of the Phi model series including Phi-4-multimodal, Phi-4-mini (February, 2025), Phi-3.5, Phi-3 to both open source community and Azure AI.
drove AI strategies at scale and bring clarity to the leadership team at the Microsoft Office of the CTO, particularly on large model training roadmap and real-world product scenarios.

My professional career is a mix of industrial research labs and startups. I spend a few years at the Machine Intelligence Technology Lab, DAMO Academy, Alibaba. My main focus is to break the language barriers across the Alibaba ecosystem by researching and developing AI solutions for eCommerce scenarios. I was an early machine learning engineer at Textio, a start-up of augmented writing, where I was responsible for training and deploying prediction models. I worked for Microsoft on machine learning models in wearable devices such as the HoloLens project. I was a machine translations researcher at SDL. I am actively coaching and consultanting early-stage startups and young engineers in Vietnam.

Specialties: GenAI, AI strategies, LLM, multimodal, AI product deployments.

Interests

Large Language and Multimodal Modeling
Model Benchmarks
Agentic-based Applications
Data Synthetics and Curation

Education

PhD in Language Technology, 2012

Carnegie Mellon University
MS in Computer Science, 2005

Johns Hopkins University
BSc in Maths & CS, 2001

Vietnam National University, Hanoi

Experience

Principal engineer

Microsoft

Nov 2021 – Present Redmond, Washington

AI at Scale

Staff engineer

Alibaba

Jul 2018 – Nov 2021 Bellevue, Washington

Breaking language barriers in the Alibaba ecosystem

Software engineer

Textio

Oct 2016 – Jul 2018 Seattle, Washington

As the 1st machine learning engineer, I’ve helped build Textio’s core predictive engine and learning loop for the augmented writing platform which already used by thousands of companies worldwide.

Spearheaded the development of the Textio core models with cutting-edge technologies in statistical natural language processing and machine learning.
Design, develop, ship, and improve production features, such as prediction engines for equal opportunity employment, job type, and document type.
Created scoring models that helped increase predictive power significantly while preserving explainability and interpretability.

Research scientist

Microsoft

Jan 2014 – Oct 2016 Redmon, Washington

Working on the next generation of wearable devices at Microsoft, e.g. HoloLens:

BCI with deep learning models, e.g. CNN, LSTM, GRU, with a patent pending on eye tracking technology.
Implement speaker verification systems on DSP which includes enrollment with MAP adaptation, verification with novel scoring methods, and back-end training pipeline for GMMs.
Reduce memory footprint and speed up runtime for i-Vector speaker recognition system with matrix factorization. Implement average stochastic gradient descent with L2 regularization to train sub-matrices.

Research on deep neural network for brain computer interface, i-Vector, probabilistic linear discriminant analysis, matrix factorization, and DNN for multiple-speaker identification.

Research scientist

SDL

Feb 2012 – Jan 2014 Los Angeles, California

R&D in commercial machine translation systems.

Model adaptation: worked on techniques to automatically adapt background translation system to a specific domain/genre via information retrieval approach and machine learning methods.
Confidence estimation: explored methods for machine translation quality-prediction including SVM and M5P decision tree. Member of the SDL Language Weaver team that won the 2012 MT quality prediction competition.
Reordering models: implemented lexicalized reordering models with distributed Hadoop/Pig training pipeline and real-time decoding.

Skills

Leadership

Guiding teams to reach ambitious goals

Managing cross-organization teams

Ensure seamless teamwork.

Software engineering

Enough to get things done timely

Machine learning

Be able to explain LLMs to a kid

Data munging

Extract gold from dirt

Product development

Turn research ideas to business opportunities

Publications

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

We introduce Phi-4-Mini and Phi-4-Multimodal, compact yet highly capable language and multimodal models. Phi-4-Mini is a …

Abdelrahman Abouelenin, Atabak Ashfaq, Adam Atkinson, Hany Awadalla, Nguyen Bach, Jianmin Bao, Alon Benhaim, Martin Cai, Vishrav Chaudhary, Congcong Chen, Dong Chen, Dongdong Chen, Junkun Chen, Weizhu Chen, Yen-Chun Chen, Yi-ling Chen, Qi Dai, Xiyang Dai, Ruchao Fan, Mei Gao, Min Gao, Amit Garg, Abhishek Goswami, Junheng Hao, Amr Hendy, Yuxuan Hu, Xin Jin, Mahmoud Khademi, Dongwoo Kim, Young Jin Kim, Gina Lee, Jinyu Li, Yunsheng Li, Chen Liang, Xihui Lin, Zeqi Lin, Mengchen Liu, Yang Liu, Gilsinia Lopez, Chong Luo, Piyush Madan, Vadim Mazalov, Ali Mousavi, Anh Nguyen, Jing Pan, Daniel Perez-Becker, Jacob Platin, Thomas Portet, Kai Qin, Bo Ren, Liliang Ren, Sambuddha Roy, Ning Shang, Yelong Shen, Saksham Singhal, Subhojit Som, Xia Song, Tetyana Sych, Praneetha Vaddamanu, Shuohang Wang, Yiming Wang, Zhenghao Wang, Haibin Wu, Haoran Xu, Weijian Xu, Yifan Yang, Ziyi Yang, Dongkun Yu, Ishmam Zabir, Jianwen Zhang, Li Lynn Zhang, Yunan Zhang, Xiren Zhou

PDF Code

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured …

Marah Abdin, Jyoti Aneja, Hany Awadalla, Ahmed Awadallah, Ammar Ahmad Awan, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Qin Cai, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Weizhu Chen, Yen-Chun Chen, Yi-Ling Chen, Hao Cheng, Parul Chopra, Xiyang Dai, Matthew Dixon, Ronen Eldan, Victor Fragoso, Chanfeng Gao, Mei Gao, Min Gao, Amit Garg, Allie Del Giorno, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Wenxiang Hu, Jamie Huynh, Dan Iter, Sam Ade Jacobs, Mojan Javaheripi, Xin Jin, Nikos Karampatziakis, Piero Kauffmann, Mahood Khademi, Dongwoo Kim, Young Jin Kim, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Yunsheng Li, Chen Liang, Lars Liden, Xihui Lin, Zeqi Lin, Ce Liu, Liyuan Liu, Mengchen Liu, Weishung Liu, Xiaodong Liu, Chong Luo, Piyush Madan, Ali Mahmoudzadeh, David Majerek, Matt Mazzola, Caio César Teodoro Mendes, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Tieyang Qin, Marko Radmilac, Liliang Ren, Gustavo de Rosa, Corby Rosset, Sambuddha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Yelong Shen, Swadheen Shukla, Xia Song, Masahiro Tanaka, Andrea Tupini, Praneetha Vaddamanu, Chunyu Wang, Guanhua Wang, Lijuan Wang, Shuohang Wang, Xin Wang, Yu Wang, Rachel Ward, Wen Wen, Philipp Witte, Haiping Wu, Xiaoxia Wu, Michael Wyatt, Bin Xiao, Can Xu, Jiahang Xu, Weijian Xu, Jilong Xue, Sonali Yadav, Fan Yang, Jianwei Yang, Zhan Yang, Ziyi Yang, Dongkun Yu, Lu Yuan, Chenruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lynn Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou

PDF Code