We advance the state of the art in natural language technologies and build systems that learn to understand and generate language in context.
We advance the state of the art in natural language technologies and build systems that learn to understand and generate language in context.
Our team comprises multiple research groups working on a wide range of natural language understanding and generation projects. We pursue long-term research to develop novel capabilities that can address the needs of current and future Google products. We publish frequently and evaluate our methods on established scientific benchmarks (e.g., SQuAD, GLUE, SuperGlue) or develop new ones for measuring progress (e.g., Conceptual Captions, Natural Questions, TyDiQA). We collaborate with other teams across Google to deploy our research to the benefit of our users. Our product contributions often stretch the boundaries of what is technically possible. Applications of our research have resulted in better language capabilities across all major Google products.
Our researchers are experts in natural language processing and machine learning with varied backgrounds and a passion for language. Computer scientists and linguists work hand-in-hand to provide insight into ways to define language tasks, collect valuable data, and assist in enabling internationalization. Researchers and engineers work together to develop new neural network models that are sensitive to the nuances of language while taking advantage of the latest advances in specialized compute hardware (e.g., TPUs) to produce scalable solutions that can be used by billions of users.
Learn contextual language representations that capture meaning at various levels of granularity and are transferable across tasks.
Learn end-to-end models for real world question answering that requires complex reasoning about concepts, entities, relations, and causality in the world.
Learn document representations from geometric features and spatial relations, multi-modal content features, syntactic, semantic and pragmatic signals.
Advance next generation dialogue systems in human-machine and multi-human-machine interactions to achieve natural user interactions and enrich conversations between human users.
Produce natural and fluent output for spoken and written text for different domains and styles.
Learning high-quality models that scale to all languages and locales and are robust to multilingual inputs, transliterations, and regional variants.
Understand visual inputs (image & video) and express that understanding using fluent natural language (phrases, sentences, paragraphs).
Use state-of-the-art machine learning techniques and large-scale infrastructure to break language barriers and offer human quality translations across many languages to make it possible to easily explore the multilingual world.
Learn to summarize single and multiple documents into cohesive and concise summaries that accurately represent the documents.
Learn end-to-end models that classify the semantics of text, such as topic, sentiment or sensitive content (i.e., offensive, inappropriate, or controversial content).
Represent, combine, and optimize models for speech to text and text to speech.
Learn models that infer entities (people, places, things) from text and that can perform reasoning based on their relationships.
Use and learn representations that span language and other modalities, such as vision, space and time, and adapt and use them for problems requiring language-conditioned action in real or simulated environments (i.e., vision-and-language navigation).
Learn models for predicting executable logical forms given text in varying domains and languages, situated within diverse task contexts.
Learn models that can detect sentiment attribution and changes in narrative, conversation, and other text or spoken scenarios.
Learn models of language that are predictable and understandable, perform well across the broadest possible range of linguistic settings and applications, and adhere to our principles of responsible practices in AI.
Ankur Parikh
Bo Pang
Colin Cherry
Dipanjan Das
Fernando Pereira
Filip Radlinski
Jacob Eisenstein
Jason Baldridge
Katja Filippova
Kellie Webster
Kenton Lee
Kristina N Toutanova
Luheng He
Massimiliano Ciaramita
Melvin Johnson
Michael Collins
Orhan Firat
Radu Soricut
Slav Petrov
Srini Narayanan
Tania Bedrax-Weiss
William W. Cohen
Wolfgang Macherey
Yasemin Altun
Yun-hsuan Sung