News by mc@sigmoid.social

Posts Subscribe

https://mcognetta.github.io/posts/leetcode-random-seed/

https://sigmoid.social/@mc/11403...

https://mcognetta.github.io/posts/leetcode-random-seed/

20.2.2025 05:17https://mcognetta.github.io/posts/leetcode-random-seed/
https://sigmoid.social/@mc/11403...

This problem came up again, so I updated my old solution!https://sigmoid.social/@mc/111662581917469235

https://sigmoid.social/@mc/11403...

This problem came up again, so I updated my old solution!

https://sigmoid.social/@mc/111662581917469235

20.2.2025 05:17This problem came up again, so I updated my old solution!https://sigmoid.social/@mc/111662581917469235
https://sigmoid.social/@mc/11403...

https://sigmoid.social/@mc/11398...

11.2.2025 08:14
https://sigmoid.social/@mc/11398...

Tokenization is an often-overlooked aspect of modern #NLP, but it’s experiencing a resurgence — thanks in large part to @karpathy and...

https://sigmoid.social/@mc/11398...

Tokenization is an often-overlooked aspect of modern #NLP, but it’s experiencing a resurgence — thanks in large part to @karpathy and his classic tweet:

x.com/karpathy/sta...

Come hang out with us and let's fix these problems!

11.2.2025 08:14Tokenization is an often-overlooked aspect of modern #NLP, but it’s experiencing a resurgence — thanks in large part to @karpathy and...
https://sigmoid.social/@mc/11398...

Today we are launching a server dedicated to Tokenization research! Come join us!discord.gg/CDJhnSvU#nlproc #machinelearning #tokenization

https://sigmoid.social/@mc/11398...

Today we are launching a server dedicated to Tokenization research! Come join us!

discord.gg/CDJhnSvU

#nlproc #machinelearning #tokenization

11.2.2025 08:13Today we are launching a server dedicated to Tokenization research! Come join us!discord.gg/CDJhnSvU#nlproc #machinelearning #tokenization
https://sigmoid.social/@mc/11398...

Geniushttps://youtu.be/TUtafoC4-7k#chess #programming

https://sigmoid.social/@mc/11340...

Genius

https://youtu.be/TUtafoC4-7k

#chess #programming

31.10.2024 14:15Geniushttps://youtu.be/TUtafoC4-7k#chess #programming
https://sigmoid.social/@mc/11340...

Gboard never stops innovating.https://youtu.be/EHqPrHTN1dU#keyboard #jp

https://sigmoid.social/@mc/11325...

Gboard never stops innovating.

https://youtu.be/EHqPrHTN1dU

#keyboard #jp

4.10.2024 23:56Gboard never stops innovating.https://youtu.be/EHqPrHTN1dU#keyboard #jp
https://sigmoid.social/@mc/11325...

It's frustrating that you can't use the walrus operator in list comprehensions where you can in the unrolled loop.Not that I would...

https://sigmoid.social/@mc/11271...

It's frustrating that you can't use the walrus operator in list comprehensions where you can in the unrolled loop.

Not that I would do it often in real code, it's just annoying for when I want to golf.

#python

2.7.2024 07:55It's frustrating that you can't use the walrus operator in list comprehensions where you can in the unrolled loop.Not that I would...
https://sigmoid.social/@mc/11271...

(Another) another day, another Japanese karaoke Korean keyboard variant. Just how many of these are there?This is like the third variant...

https://sigmoid.social/@mc/11256...

(Another) another day, another Japanese karaoke Korean keyboard variant. Just how many of these are there?

This is like the third variant I've seen at the same karaoke chain. Otherwise the UI has been mostly the same across different locations.

#korean #keyboard

6.6.2024 02:21(Another) another day, another Japanese karaoke Korean keyboard variant. Just how many of these are there?This is like the third variant...
https://sigmoid.social/@mc/11256...

I can't be stopped

https://sigmoid.social/@mc/11244...

I can't be stopped

16.5.2024 05:33I can't be stopped
https://sigmoid.social/@mc/11244...

I spent a bit of time last night and this morning over-optimizing a naive #Python #LeetCode solution to get the fastest solution on the...

https://sigmoid.social/@mc/11244...

I spent a bit of time last night and this morning over-optimizing a naive #Python #LeetCode solution to get the fastest solution on the site.

Enjoy: https://theoreticallygoodwithcomputers.com/posts/leetcode-gold-optimization/

15.5.2024 16:30I spent a bit of time last night and this morning over-optimizing a naive #Python #LeetCode solution to get the fastest solution on the...
https://sigmoid.social/@mc/11244...

Starting in 5 minutes!

https://sigmoid.social/@mc/11215...

Starting in 5 minutes!

25.3.2024 14:56Starting in 5 minutes!
https://sigmoid.social/@mc/11215...

Today in the Formal Languages and Neural Networks (FLaNN) seminar, we have Nur Lan presenting `Minimum Description Length Recurrent Neural...

https://sigmoid.social/@mc/11215...

Today in the Formal Languages and Neural Networks (FLaNN) seminar, we have Nur Lan presenting `Minimum Description Length Recurrent Neural Networks`

https://aclanthology.org/2022.tacl-1.45/

Starts at 3PM GMT (Monday)!

https://flann.super.site/

#ML #Interpretability #NeuralNetworks

25.3.2024 13:33Today in the Formal Languages and Neural Networks (FLaNN) seminar, we have Nur Lan presenting `Minimum Description Length Recurrent Neural...
https://sigmoid.social/@mc/11215...

The Elements of Differentiable Programminghttps://arxiv.org/abs/2403.14606Looks like an incredible resource.#ML

https://sigmoid.social/@mc/11213...

The Elements of Differentiable Programming

https://arxiv.org/abs/2403.14606

Looks like an incredible resource.

#ML

22.3.2024 13:57The Elements of Differentiable Programminghttps://arxiv.org/abs/2403.14606Looks like an incredible resource.#ML
https://sigmoid.social/@mc/11213...

As discussed in the original paper, Rényi Efficiency does a pretty good job of predicting relative downstream performance, but there were...

https://sigmoid.social/@mc/11198...

As discussed in the original paper, Rényi Efficiency does a pretty good job of predicting relative downstream performance, but there were effects that it could not capture. Our paper gives explicit examples of some of those effects.

However, it doesn't mean that Rényi Efficiency is useless. One of our counterexamples is very unnatural and the other is also probably not useful IRL. That Rényi Efficiency fails on these isn't necessarily indicative of how it fares on other realistic tokenizers.

24.2.2024 05:55As discussed in the original paper, Rényi Efficiency does a pretty good job of predicting relative downstream performance, but there were...
https://sigmoid.social/@mc/11198...

The original paper was written by our second author (also found here https://twitter.com/zouharvi)....

https://sigmoid.social/@mc/11198...

The original paper was written by our second author (also found here https://twitter.com/zouharvi).

https://aclanthology.org/2023.acl-long.284/

The goal was to find which metrics best correlated with downstream performance (e.g. BLEU).

Our counterexamples allow us to arbitrarily increase Rényi Efficiency while _decreasing_ BLEU.

It turns out these counterexamples also break other popular metrics (e.g. average tokenized sequence length).

24.2.2024 05:52The original paper was written by our second author (also found here https://twitter.com/zouharvi)....
https://sigmoid.social/@mc/11198...

My recent paper, Two Counterexamples to Tokenization and the Noiseless Channel, was accepted to LREC-COLING!We look at a recent metric,...

https://sigmoid.social/@mc/11198...

My recent paper, Two Counterexamples to *Tokenization and the Noiseless Channel*, was accepted to LREC-COLING!

We look at a recent metric, Rényi Efficiency, for *intrinsically* evaluating tokenizers. That is, how can we determine if a tokenizer will be good without having to train a full model (or "metric goes up implies BLEU goes up")?

We design two tokenizer families that break the metric.

https://arxiv.org/abs/2402.14614

Joint work with @zouharvi @s and https://twitter.com/chokkanorg

#nlproc #NLP

24.2.2024 05:50My recent paper, Two Counterexamples to *Tokenization and the Noiseless Channel*, was accepted to LREC-COLING!We look at a recent metric,...
https://sigmoid.social/@mc/11198...

Some #HPC and #Julia internships at the Swiss National Supercomputing Center:https://jobs.ethz.ch/job/view/JOPG_ethz_sRMTIUMhsyaWMB4LVv

https://sigmoid.social/@mc/11194...

Some #HPC and #Julia internships at the Swiss National Supercomputing Center:

https://jobs.ethz.ch/job/view/JOPG_ethz_sRMTIUMhsyaWMB4LVv

16.2.2024 11:33Some #HPC and #Julia internships at the Swiss National Supercomputing Center:https://jobs.ethz.ch/job/view/JOPG_ethz_sRMTIUMhsyaWMB4LVv
https://sigmoid.social/@mc/11194...

This month in #Julia https://discourse.julialang.org/t/this-month-in-julia-world-2024-01/109549#JuliaLang

https://sigmoid.social/@mc/11185...

This month in #Julia

https://discourse.julialang.org/t/this-month-in-julia-world-2024-01/109549

#JuliaLang

1.2.2024 05:22This month in #Julia https://discourse.julialang.org/t/this-month-in-julia-world-2024-01/109549#JuliaLang
https://sigmoid.social/@mc/11185...

A really nice list of Gaussian Splatting research and implementations: https://huggingface.co/spaces/dylanebert/research-trackerCompiled by...

https://sigmoid.social/@mc/11184...

A really nice list of Gaussian Splatting research and implementations:

https://huggingface.co/spaces/dylanebert/research-tracker

Compiled by https://twitter.com/dylan_ebert_ from #HuggingFace

#GaussianSplatting #ML

30.1.2024 07:06A really nice list of Gaussian Splatting research and implementations: https://huggingface.co/spaces/dylanebert/research-trackerCompiled by...
https://sigmoid.social/@mc/11184...

mc@sigmoid.social

mc - Network

https://mcognetta.github.io/posts/leetcode-random-seed/

This problem came up again, so I updated my old solution!https://sigmoid.social/@mc/111662581917469235

Tokenization is an often-overlooked aspect of modern #NLP, but it’s experiencing a resurgence — thanks in large part to @karpathy and...

Today we are launching a server dedicated to Tokenization research! Come join us!discord.gg/CDJhnSvU#nlproc #machinelearning #tokenization

Geniushttps://youtu.be/TUtafoC4-7k#chess #programming

Gboard never stops innovating.https://youtu.be/EHqPrHTN1dU#keyboard #jp

It's frustrating that you can't use the walrus operator in list comprehensions where you can in the unrolled loop.Not that I would...

(Another) another day, another Japanese karaoke Korean keyboard variant. Just how many of these are there?This is like the third variant...

I can't be stopped

I spent a bit of time last night and this morning over-optimizing a naive #Python #LeetCode solution to get the fastest solution on the...

Starting in 5 minutes!

Today in the Formal Languages and Neural Networks (FLaNN) seminar, we have Nur Lan presenting `Minimum Description Length Recurrent Neural...

The Elements of Differentiable Programminghttps://arxiv.org/abs/2403.14606Looks like an incredible resource.#ML

As discussed in the original paper, Rényi Efficiency does a pretty good job of predicting relative downstream performance, but there were...

The original paper was written by our second author (also found here https://twitter.com/zouharvi)....

My recent paper, Two Counterexamples to *Tokenization and the Noiseless Channel*, was accepted to LREC-COLING!We look at a recent metric,...

Some #HPC and #Julia internships at the Swiss National Supercomputing Center:https://jobs.ethz.ch/job/view/JOPG_ethz_sRMTIUMhsyaWMB4LVv

This month in #Julia https://discourse.julialang.org/t/this-month-in-julia-world-2024-01/109549#JuliaLang

A really nice list of Gaussian Splatting research and implementations: https://huggingface.co/spaces/dylanebert/research-trackerCompiled by...

My recent paper, Two Counterexamples to Tokenization and the Noiseless Channel, was accepted to LREC-COLING!We look at a recent metric,...