Archive Tags Search Consulting Subscribe

Veritable Tech Blog

Technical Notes from a Data Geek.

  • Apr 18
    How to Reduce the Loading Time of Julia Scripts

    Creating and optimizing custom sysimages

  • Apr 4
    [Notes] Gradient Checkpointing with BERT

    A brief analysis of huggingface's implementation

  • Mar 18
    [Paper] Adafactor: Adaptive Learning Rates with Sublinear Memory Cost

    Essential for fine-tuning T5 v1.1 and mT5 models

  • Mar 14
    Mistake I Made that Crippled My Streamlit App

    Not properly caching slows down the app and increases memory consumption

  • Feb 19
    A Case Study of fastcore @patch_to

    Trying out SnapMix with minimal changes to the codebase

Older Posts →

© Copyright 2021 Ceshine Lee

Powered by Hugo Theme By nodejh

Buy me a coffeeBuy me a coffee