May 13, 2025 | I successfully defended my PhD dissertation, “Language Models as Opinion Models: Techniques and Applications,” earlier today! The dissertation is not available online yet, but it and preprint versions of new work in it will be shortly. |
Apr 26, 2025 | Our paper “Bridging the Data Provenance Gap Across Text, Speech and Video” appeared today at ICLR 2025! |
Jan 22, 2025 | ICLR 2025 has accepted our new paper “Bridging the Data Provenance Gap Across Text, Speech and Video”! This paper is the third phase of work in the Data Provenance Initiative. |
Dec 11, 2024 | The latest Data Provenance Initiative paper, “Consent in Crisis: The Rapid Decline of the AI Data Commons”, appeared today at NeurIPS 2024. |
Nov 12, 2024 | Our paper “On the Relationship between Truth and Political Bias in Language Models” appeared as a main-conference poster at EMNLP 2024! |
Nov 12, 2024 | Our demo “AudienceView: AI-Assisted Interpretation of Audience Feedback in Journalism” appeared at CSCW 2024 in Costa Rica! |
Oct 26, 2024 | I presented our work “The speed of news in Twitter (X) versus radio” at this year’s C+J Symposium. |
Sep 26, 2024 | NeurIPS 2024 has accepted our paper “Consent in Crisis: The Rapid Decline of the AI Data Commons”! |
Sep 20, 2024 | Our new paper “On the Relationship between Truth and Political Bias in Language Models” has been accepted for a main-conference presentation at EMNLP 2024! |
Aug 15, 2024 | I presented our work “ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings” at TextGraphs 2024, hosted this year at ACL in Bangkok. |
Jul 18, 2024 | Our paper “The speed of news in Twitter (X) versus radio” was presented at this year’s IC2S2! Check out a recorded version of the talk here. |
Jul 10, 2024 | Our paper “A Large-Scale Audit of Dataset Licensing and Attribution in AI” has been accepted at Nature Machine Intelligence! This paper represents the first phase of work on the Data Provenance Initiative. |
Jul 02, 2024 | CSCW 2024 has accepted our paper “Bridging Dictionary: AI-Generated Dictionary of Partisan Language Use.” |
Jul 02, 2024 | Our new paper, “AudienceView: AI-Assisted Interpretation of Audience Feedback in Journalism,” has been accepted at CSCW 2024! Check out the deployed demo at frontline.ccc-mit.org. |
Jun 17, 2024 | New accepted paper! Our work “ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings” will appear at TextGraphs 2024, hosted this year at ACL. |
Jun 06, 2024 | Our position paper, “Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?” has been selected as a spotlight paper (top 3\%) at ICML 2024! |
May 10, 2024 | Our paper “The speed of news in Twitter (X) versus radio” has been accepted at Scientific Reports! |
May 01, 2024 | New paper! ICML 2024 has accepted our paper “Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?.” |
Oct 30, 2023 | Our entry in MIT’s IGNITE Generative AI Entrepreneurship Competition placed as a finalist, winning a $5,000 award. |
Sep 01, 2023 | MIT has selected our work on the Data Provenance Initiative for a Generative AI Impact Award. The award includes $70,000 of research funding. |
Jul 11, 2023 | Our paper Dubbing in Practice was presented at this year’s ACL. |
Dec 13, 2022 | TACL has accepted our paper “Dubbing in Practice: A Large-Scale Study of Human Localization With Insights for Automatic Dubbing”! |