AI
an archive of posts in this category
| Aug 19, 2025 | Diffusion series - DDPM model |
|---|---|
| Mar 20, 2025 | Understanding Attention and Multi-Head Attention - From Basics to RoPE Optimization |
| Oct 10, 2024 | Grokking - a possible way to achieve AGI |
| Sep 10, 2024 | Grokking - a possible way to achieve AGI |
| Aug 24, 2024 | History of Position Encoding |
| Aug 13, 2024 | LoRA - a potential parameter efficient fine-tuning method for large models |