Xuedong Huang | |
---|---|
Born | |
Citizenship | United States |
Alma mater | University of Edinburgh, Tsinghua University, Hunan University |
Awards | National Academy of Engineering Member, American Academy of Arts and Sciences Member, IEEE Bose Industrial Leader Award, Asian American Corporate Leadership Award, ACM Fellow, IEEE Fellow |
Scientific career | |
Fields | Speech Recognition, Machine translation, Natural Language Processing, AI, Computer Vision, Software Development |
Institutions | Zoom Video Communications, Microsoft, Carnegie Mellon University |
Xuedong David Huang (born October 20, 1962) is a Chinese American computer scientist and technology executive who has made contributions to spoken language processing and artificial intelligence, including Azure AI Services. He is Zoom's chief technology officer after serving as Microsoft's Technical Fellow and Azure AI Chief Technology Officer for 30 years. Huang is a strong advocate of AI for Accessibility,[1] and AI for Cultural Heritage.[2]
Education
Huang received his PhD from the University of Edinburgh in 1989 (sponsored by the British ORS and Edinburgh University Scholarship), his MS from Tsinghua University in 1984, and BS from Hunan University in 1982.
Career
After receiving his PhD in 1989, Huang joined Carnegie Mellon University and worked with Raj Reddy and Kai-Fu Lee on speech recognition. At CMU, he directed the Sphinx-II speech system research which achieved the best performance in every category of DARPA's 1992 benchmarking. Microsoft Research recruited him to found and lead Microsoft's spoken language initiatives in 1993. His co-authored book Spoken Language Processing[3] and his Historical speech recognition review[4] succinctly summarize several generations of spoken language research. As Microsoft's Mr. Speech for three decades, Huang has been instrumental in creating Microsoft's Speech Application Programming Interface (SAPI), shipping Microsoft Speech Server, and modernizing spoken language and integrative AI services [5][6] via Azure AI,[7] which not only enables millions of 3rd party customers but also powers up Microsoft's Windows, Office, Teams, and Azure OpenAI Services.
Huang helped Microsoft and Azure Cognitive Services achieve multiple industry's first human parity milestones on the following open research tasks: transcribing conversational speech,[8] machine translation,[9] conversational QnA,[10] and computer vision image captioning.[11]
Huang has made significant contributions to the software and AI industry through his executive leadership and his scientific publications, owning more than 170 US patents and impacting billions through Azure AI enabled products and services. In 2016, Wired magazine named him one of 25 Geniuses.[12] In 2021, Azure AI was named the winner of InfoWorld's Technology of the Year Award.[13]
Huang was awarded the Allen Newell research excellence medal in 1992, and IEEE Speech Processing Best Paper in 1993. He was recognized as an IEEE Fellow by Institute of Electrical and Electronics Engineers in 2000, named ACM Fellow by Association for Computing Machinery in 2017, [14] and a member of Washington State Academy of Sciences. Huang received 2022 Asian American Corporate Leadership Award, and IEEE Amar Bose Industrial Leader Award. In 2023, he was elected a member of the US National Academy of Engineering (NAE),[15] and a member of the American Academy of Arts and Sciences.[16]
References
- ↑ "Azure AI for Accessibility". www.linkedin.com. Retrieved 2021-02-09.
- ↑ "Xuedong Huang on LinkedIn: Microsoft Introduces Inuktitut to Microsoft Translator - Microsoft". www.linkedin.com. Retrieved 2021-02-09.
- ↑ Spoken Language Processing, Prentice Hall 2001 Xuedong Huang, Alex Acero, and Hsiao-Wuen Hon
- ↑ A Historical Perspective of Speech Recognition Xuedong Huang, James Baker, Raj Reddy. Communications of the ACM, January 2014, Vol. 57 No. 1, Pages 94-103.
- ↑ Stanford's Speech Transcription Bias Study in 2020
- ↑ XYZ-Code: A holistic representation toward integrative AI, Microsoft AI blog
- ↑ Azure AI Cognitive Services
- ↑ Historic Achievement: Microsoft researchers reach human parity in conversational speech recognition October 18, 2016 | Allison Linn
- ↑ Microsoft reaches a historic milestone, using AI to match human performance in translating news from Chinese to English March 14, 2018 | Allison Linn
- ↑ Machine Reading Systems Are Becoming More Conversational May 2019
- ↑ What’s that? Microsoft’s latest breakthrough, now in Azure AI, describes images as well as people do Oct 14, 2020 | John Roach
- ↑ 25 Geniuses Who Are Creating the Future of Business 04.26.2016
- ↑ Yegulalp, James R. Borck, Martin Heller, Steven Nuñez, Andrew C. Oliver, Ian Pointer, Isaac Sacolick and Serdar (2021-02-03). "InfoWorld's 2021 Technology of the Year Award winners". InfoWorld. Retrieved 2021-02-08.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ↑ People of ACM - Xuedong Huang July 25, 2017
- ↑ National Academy of Engineering Elects 106 Members and 18 International Members Feb 7, 2023
- ↑ New Members Elected in 2023: American Academy of Arts and Sciences April 19, 2023 Huang joined Zoom at June, 2023 as CTO.