1. | | Diffusion models are evolutionary algorithms (gonzoml.substack.com) |
|
123 points by che_shr_cat 6 days ago | past | 25 comments
|
2. | | Make Softmax Great Again (gonzoml.substack.com) |
|
2 points by che_shr_cat 9 days ago | past | discuss
|
3. | | Deep Learning Frameworks: The Fourth Pillar of Deep Learning Revolution (gonzoml.substack.com) |
|
1 point by che_shr_cat 11 days ago | past | discuss
|
4. | | TextGrad: Automatic "Differentiation" via Text (gonzoml.substack.com) |
|
3 points by che_shr_cat 4 months ago | past
|
5. | | Superconducting Supercomputers (gonzoml.substack.com) |
|
1 point by che_shr_cat 4 months ago | past
|
6. | | Decoder-decoder architecture is coming (gonzoml.substack.com) |
|
2 points by che_shr_cat 5 months ago | past
|
7. | | Chronos: Using Pretrained LLMs for Probabilistic Time Series Forecasting (gonzoml.substack.com) |
|
2 points by che_shr_cat 6 months ago | past
|
8. | | Big Post About Big Context (gonzoml.substack.com) |
|
49 points by che_shr_cat 8 months ago | past | 19 comments
|
9. | | Neural Network Diffusion (gonzoml.substack.com) |
|
1 point by che_shr_cat 8 months ago | past
|
10. | | Thermodynamic AI is getting hotter (gonzoml.substack.com) |
|
51 points by che_shr_cat 9 months ago | past | 5 comments
|
11. | | Training LLMs with AMD GPUs on Frontier Supercomputer (gonzoml.substack.com) |
|
1 point by che_shr_cat 10 months ago | past
|
12. | | Beyond Chinchilla-Optimal Accounting for Inference in Language Model Scaling Law (gonzoml.substack.com) |
|
1 point by che_shr_cat 10 months ago | past
|
13. | | Project CETI (gonzoml.substack.com) |
|
2 points by che_shr_cat 11 months ago | past
|
14. | | GonzoML on Mamba and S6 (+previous post on S4) (gonzoml.substack.com) |
|
1 point by che_shr_cat 11 months ago | past
|
15. | | Conway's Game of Life Is Omniperiodic (gonzoml.substack.com) |
|
2 points by che_shr_cat 11 months ago | past | 1 comment
|
16. | | GonzoML on Gemini (gonzoml.substack.com) |
|
2 points by che_shr_cat 11 months ago | past
|
17. | | Matryoshka Representation Learning (gonzoml.substack.com) |
|
2 points by che_shr_cat on Nov 3, 2023 | past
|
18. | | Mindstorms in Natural Language-Based Societies of Mind (gonzoml.substack.com) |
|
2 points by che_shr_cat on Oct 29, 2023 | past
|
19. | | The convolution empire strikes back (gonzoml.substack.com) |
|
132 points by che_shr_cat on Oct 27, 2023 | past | 56 comments
|
20. | | Sparse Universal Transformer (gonzoml.substack.com) |
|
3 points by che_shr_cat on Oct 23, 2023 | past
|
21. | | MemWalker: An alternative way for working with long documents using transformers (gonzoml.substack.com) |
|
1 point by che_shr_cat on Oct 17, 2023 | past
|
22. | | "Building Machines That Learn and Think Like People", 7 Years Later (gonzoml.substack.com) |
|
106 points by che_shr_cat on Oct 13, 2023 | past | 40 comments
|
23. | | Chain-of-Thought → Tree-of-Thought (gonzoml.substack.com) |
|
1 point by che_shr_cat on Oct 10, 2023 | past
|
24. | | Mortal Computers (gonzoml.substack.com) |
|
31 points by che_shr_cat on Oct 9, 2023 | past | 1 comment
|
25. | | Levanter – Legible, Scalable, Reproducible Foundation Models with Jax (stanford.edu) |
|
1 point by che_shr_cat on June 20, 2023 | past
|
26. | | LM-3 –- resurrecting the MIT CADR (tumbleweed.nu) |
|
1 point by che_shr_cat on May 12, 2023 | past
|
27. | | The Annotated Diffusion Model (huggingface.co) |
|
1 point by che_shr_cat on June 13, 2022 | past
|
28. | | Road to text-guided image generation: DALL·E, CLIP, GLIDE, DALL·E 2 (unCLIP) (inten.to) |
|
2 points by che_shr_cat on May 6, 2022 | past
|
29. | | Self-replicating radiation-shield for deep-space exploration: Radiotrophic fungi (biorxiv.org) |
|
197 points by che_shr_cat on Jan 15, 2022 | past | 83 comments
|
30. | | Revisiting ‘Powers of Ten’ – what we’ve learned about the Universe since 1977 (aeon.co) |
|
3 points by che_shr_cat on Jan 15, 2022 | past
|
|
|
More |