Veritable Tech Blog

Apr 18
How to Reduce the Loading Time of Julia Scripts
Creating and optimizing custom sysimages
Apr 4
[Notes] Gradient Checkpointing with BERT
A brief analysis of huggingface's implementation
Mar 18
[Paper] Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Essential for fine-tuning T5 v1.1 and mT5 models
Mar 14
Mistake I Made that Crippled My Streamlit App
Not properly caching slows down the app and increases memory consumption
Feb 19
A Case Study of fastcore @patch_to
Trying out SnapMix with minimal changes to the codebase