An Explorable explaining the concept of patchoscopes for an external audience. Patchoscopes is an interpretability tool that allows researchers to better understand an LLMs output representations through natural language experiments.
Learn more about how we research
We maintain a portfolio of research projects, providing individuals and teams the freedom to emphasize specific types of work