←
Home
Archive
Tags
Search
Consulting
Subscribe
Veritable Tech Blog
Technical Notes from a Data Geek.
2023
Jul 16
[Notes] MaxViT: Multi-Axis Vision Transformer
2022
May 15
[Notes] PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions
Mar 14
[Notes] Understanding Visual Attention Network
Jan 28
[Notes] Understanding ConvNeXt
2021
Jul 25
[Notes] Understanding XCiT - Part 2
Jul 24
[Notes] Understanding XCiT - Part 1
Apr 4
[Notes] Gradient Checkpointing with BERT
Mar 18
[Paper] Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Mar 14
Mistake I Made that Crippled My Streamlit App
Feb 19
A Case Study of fastcore @patch_to
Feb 14
[Paper] Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control
2020
Dec 28
[Kaggle] Google Research Football 2020
Dec 22
[PyTorch Lightning] Log Training Losses when Accumulating Gradients
Jul 11
[Tip] TorchScript Supports Half Precision
Jul 6
Self-Supervised Domain Adaptation
Jun 16
[Failure Report] Distill Fine-tuned Transformers into Recurrent Neural Networks
May 4
Deploying EfficientNet Model using TorchServe
2019
Nov 28
Fine-tuning BERT for Similarity Search
Sep 24
Zero Shot Cross-Lingual Transfer with Multilingual BERT
Aug 22
More Memory-Efficient Swish Activation Function
Jun 21
Smaller Docker Image using Multi-Stage Build
Jun 13
Mixed Precision Training on Tesla T4 and P100
Mar 26
Use NVIDIA Apex for Easy Mixed Precision Training in PyTorch
Feb 15
Multilingual Similarity Search Using Pretrained Bidirectional LSTM Encoder
Feb 10
News Topic Similarity Measure using Pretrained BERT Model
2018
Oct 13
[Notes] Neural Language Models with PyTorch
2017
Jul 27
[Learning Note] Single Shot MultiBox Detector with Pytorch — Part 3
Jul 26
[Learning Note] Single Shot MultiBox Detector with Pytorch — Part 2
Jul 24
[Learning Note] Single Shot MultiBox Detector with PyTorch — Part 1