Fantastic new blog post by Nicholas Carlini on LLMs memorizing their training data and what it means (and doesn't mean) for copyright: https://nicholas.carlini.com/writing/2025/privacy-copyright-and-generative-models.html
11.3.2025 17:24Fantastic new blog post by Nicholas Carlini on LLMs memorizing their training data and what it means (and doesn't mean) for copyright:...I got annoyed couldn't find a good glossary of DP terms, so I wrote one: https://desfontain.es/blog/differential-privacy-glossary.html ✨
Simple definitions, links to further reading, and some mild snark about how bad we are at naming things sometimes 🙃
10.3.2025 20:16I got annoyed couldn't find a good glossary of DP terms, so I wrote one: https://desfontain.es/blog/differential-privacy-glossary.html...joke spoiler
boba kitty 😺
10.3.2025 17:21joke spoilerboba kitty 😺Tired: bouba/kiki
Wired:
FUCK YES 🥳
Onwards to 2000 now!!! 🚀
10.3.2025 12:55FUCK YES 🥳Onwards to 2000 now!!! 🚀I just noticed that the equations on my blog are now displayed in a weirdly tiny font on Chromium (see e.g. https://desfontain.es/blog/dp-vision.html). I don't get the bug on Firefox. Does it do that for you too? Does anyone know how to fix it? =/
9.3.2025 22:36I just noticed that the equations on my blog are now displayed in a weirdly tiny font on Chromium (see e.g....Strongly recommend the Mille et Une Orchidées exhibition in Paris. It felt like @jerry's personal paradise 🤩
8.3.2025 22:51Strongly recommend the Mille et Une Orchidées exhibition in Paris. It felt like @jerry's personal paradise 🤩Tonight I learned how to sharpen knives at https://www.lorenzimesser.ch/ and I had a great time — Reto is an absolute pro and an excellent teacher. I strongly recommend it 🔪
7.3.2025 21:42Tonight I learned how to sharpen knives at https://www.lorenzimesser.ch/ and I had a great time — Reto is an absolute pro and an excellent...Projet "un gâteau par chapitre du livre de @Owi": Lemon Bling-Bling 🍋, Fraise & Gingembre 🍓🫚, Fol Noix 🌰, et le Brownie Passion qui était tellement bon qu'on a oublié de faire une photo 🍫
Et bah tout est merveilleux hein. 10/10 j'ai hâte des chapitres suivants 🤤
5.3.2025 07:54Projet "un gâteau par chapitre du livre de @Owi": Lemon Bling-Bling 🍋, Fraise & Gingembre 🍓🫚, Fol Noix 🌰, et le...Today's unusual praise for my blog: "I really liked the post about converting your PhD thesis from LaTeX to HTML, especially the whole 'keep going through the pain' vibe of it" 🤔
19.2.2025 15:53Today's unusual praise for my blog: "I really liked the post about converting your PhD thesis from LaTeX to HTML, especially the...Idle question: is there a strong reason why Python, which generally tries to have a super simple syntax for everything, doesn't allow iterators like `for key, value in d` where d is a dict, but instead forces users to write `for key, value in d.items()`?
19.2.2025 12:42Idle question: is there a strong reason why Python, which generally tries to have a super simple syntax for everything, doesn't allow...Friend lamp!!! 😺
https://machinelearning.apple.com/research/elegnt-expressive-functional-movement
Fun idea found on Reddit: Wordle, except you have to find a chess mating position ️♟️
https://www.matle.io/
Nothing motivates me to write more blog posts about synthetic data than synthetic data vendors being defensive in my LinkedIn mentions whenever I say things like "there's no silver bullet" 👀
30.1.2025 08:47Nothing motivates me to write more blog posts about synthetic data than synthetic data vendors being defensive in my LinkedIn mentions...Just published a ✨ new blog post ✨ on synthetic data generation! In which I dismantle the marketing-fueled myth of synthetic data as a silver bullet, and talk about fundamental trade-offs instead 📊
https://www.tmlt.io/resources/fundamental-trilemma-synthetic-data-generation
29.1.2025 19:48Just published a ✨ new blog post ✨ on synthetic data generation! In which I dismantle the marketing-fueled myth of synthetic data as a...This was stressful to write and even more stressful to publish, but I've had to have this conversation too many times with too many different people, and it seems worthwhile to have everything summarized in a single place 🫤
20.1.2025 12:43This was stressful to write and even more stressful to publish, but I've had to have this conversation too many times with too many...I wrote a blog post about Diffprivlib, a well-known differential privacy library.
https://desfontain.es/blog/diffprivlib.html
20.1.2025 12:35I wrote a blog post about Diffprivlib, a well-known differential privacy library.https://desfontain.es/blog/diffprivlib.htmlWeirdly specific gripe: when searching "fireplace" on YouTube, it's annoyingly hard to find atmosphere videos that are real wood fires. A ton of it is AI and much of the rest is gas fires pretending to be wood logs burning 😒
17.1.2025 20:12Weirdly specific gripe: when searching "fireplace" on YouTube, it's annoyingly hard to find atmosphere videos that are real...I'm out there writing blog posts like "an evil adversary could design clever attacks to extract some personal data from LLMs!" and meanwhile Google's Gemini is just leaking the user's home address, unprompted, while playing chess 🫠
https://youtu.be/0a_gYIO47Ac?feature=shared&t=504 🙃
15.1.2025 10:28I'm out there writing blog posts like "an evil adversary could design clever attacks to extract some personal data from LLMs!"...Maybe for the shortform social media audience I should have chosen one of the spicier diagrams as a teaser image 🤔
13.1.2025 15:18Maybe for the shortform social media audience I should have chosen one of the spicier diagrams as a teaser image 🤔