sthasushant
Posts
Posts
2024-09-21
A note on 1st and 2nd order optimization
2024-09-14
A Note on Normalization
2024-09-08
A Note on Positional Encoding
2024-09-06
A Note on InfLLM v2: Coarse Memory, Fine Recall
2024-09-05
A note on InfLLM v1
2024-09-01
Amazing Linear Units
2024-08-30
What is more on Flash attention 2 and 3 ?
2024-08-30
A note on Flash Attention