Simple Thoughts
All Posts Tags Categories About
Simple Thoughts
Cancel
All PostsTagsCategoriesAbout

All Posts

2025

Time To Think February 4, 2025

2024

Calculating the Cost of a Google Deepmind Paper July 30, 2024
DeepSeek Core Readings 0 - Coder June 30, 2024
DeepSeek Core Readings 1 - LLM June 23, 2024
Basic tips for remaining conscious April 12, 2024

2023

2023 December 31, 2023
Rough thoughts on Mixtral vs Open Source December 13, 2023
Knowing Enough About MoE to Explain Dropped Tokens in GPT-4 August 9, 2023
Non-determinism in GPT-4 is caused by Sparse MoE August 5, 2023
Dumped Blog Ideas July 2, 2023
  • 1
  • 2
  • 3
2022 - 2025 152334H