Veritable Tech Blog
Technical Notes from a Data Geek.
Reducing the SentencePiece Vocabulary Size of Pretrained NLP Models
Useful for fine-tuning on a subset of available languages
[Kaggle] Google Research Football 2020
Describing my 16th place solution and also reviewing some of the others'
[PyTorch Lightning] Log Training Losses when Accumulating Gradients
The global step is not what you think it is
Generating Synthetic Tabular Data Using GAN
A case study: detecting credit fraud
[Paper] Are We Really Making Much Progress?
A Worrying Analysis of Recent Neural Recommendation Approaches