←
Home
Archive
Tags
Search
Consulting
Subscribe
Veritable Tech Blog
Technical Notes from a Data Geek.
2023
Jul 16
[Notes] MaxViT: Multi-Axis Vision Transformer
2022
May 15
[Notes] PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions
Mar 14
[Notes] Understanding Visual Attention Network
Jan 28
[Notes] Understanding ConvNeXt
Jan 7
Use MPIRE to Parallelize PostgreSQL Queries
2021
Aug 27
[Notes] (Ir)Reproducible Machine Learning: A Case Study
Jul 25
[Notes] Understanding XCiT - Part 2
Jul 24
[Notes] Understanding XCiT - Part 1
Jun 13
How to Create a Documentation Website for Your Python Package
May 1
Text Analysis using Julia
Apr 18
How to Reduce the Loading Time of Julia Scripts
Apr 4
[Notes] Gradient Checkpointing with BERT
Mar 18
[Paper] Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Mar 14
Mistake I Made that Crippled My Streamlit App
Feb 19
A Case Study of fastcore @patch_to
Feb 14
[Paper] Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control
Jan 18
Reducing the SentencePiece Vocabulary Size of Pretrained NLP Models
2020
Dec 28
[Kaggle] Google Research Football 2020
Dec 22
[PyTorch Lightning] Log Training Losses when Accumulating Gradients
Dec 14
Generating Synthetic Tabular Data Using GAN
Dec 4
[Paper] Are We Really Making Much Progress?
Nov 23
Weird Behavior in the FiveThirtyEight 2020 Election Model
Nov 14
The Book Of Why: The New Science of Cause and Effect
Oct 17
Automatic Testing Your SQLite Database with Great Expectations
Oct 11
[Tensorflow] Training CV Models on TPU without Using Cloud Storage
Oct 7
Replicate Conda Environment in Docker
Sep 8
[Paper] Please Stop Permuting Features
Aug 19
[Paper] Language-agnostic BERT Sentence Embedding
Aug 5
[Competition] Jigsaw Multilingual Toxic Comment Classification
Jul 23
[Paper] Training Question Answering Models From Synthetic Data
Jul 11
[Tip] TorchScript Supports Half Precision
Jul 6
Self-Supervised Domain Adaptation
Jun 28
Using Julia to Do Whole Word Masking
Jun 16
[Failure Report] Distill Fine-tuned Transformers into Recurrent Neural Networks
May 4
Deploying EfficientNet Model using TorchServe
Apr 24
Tensorflow Profiler with Custom Training Loop
Apr 10
Monitor Python Script Cron Jobs using Telegram
Mar 31
Clutter-free Interactive Charts in R using Plotly
Feb 13
TensorFlow 2.1 with TPU in Practice
2019
Dec 17
Create a Customized Text Annotation Tool in Two Days - Part 2
Dec 16
Create a Customized Text Annotation Tool in Two Days - Part 1
Nov 28
Fine-tuning BERT for Similarity Search
Nov 9
[Notes] “Statistical Inference Enables Bad Science; Statistical Thinking Enables Good Science”
Oct 25
Pro Tip: Use Shutdown Script Detect Preemption on GCP
Sep 24
Zero Shot Cross-Lingual Transfer with Multilingual BERT
Aug 22
More Memory-Efficient Swish Activation Function
Aug 14
Customizing Spacy Sentence Segmentation
Aug 4
[Notes] Jigsaw Unintended Bias in Toxicity Classification
Jul 16
[Notes] iMet Collection 2019 - FGVC6 (Part 1)
Jun 25
Dealing with Synthetic Data
Jun 21
Smaller Docker Image using Multi-Stage Build
Jun 13
Mixed Precision Training on Tesla T4 and P100
May 23
[Notes] SHAP Values
Apr 24
Detecting Chinese Characters in Unicode Strings
Apr 9
A First Look at Plotly Express
Apr 4
Custom Image Augmentation with Keras
Mar 30
UMAP on RAPIDS (15x Speedup)
Mar 26
Use NVIDIA Apex for Easy Mixed Precision Training in PyTorch
Feb 15
Multilingual Similarity Search Using Pretrained Bidirectional LSTM Encoder
Feb 10
News Topic Similarity Measure using Pretrained BERT Model
Jan 22
Playing with rstudio/gt R Package
Jan 3
More Portable, Reproducible R Development Environment
2018
Dec 7
Use TextRank to Extract Most Important Sentences in Article
Nov 7
Implementing Beam Search - Part 2
Nov 5
Implementing Beam Search - Part 1
Oct 13
[Notes] Neural Language Models with PyTorch
Sep 25
Use Visual Studio Code To Develop Python Programs
Aug 10
Prepare Deep-Learning-Ready VMs on Google Cloud Platform
Jul 16
Quantile Regression — Part 2
Jul 12
Quantile Regression — Part 1
Mar 24
[Review] Kaggle Toxic Comment Classification Challenge
Feb 27
Analyzing Tweets with R
2017
Oct 28
Feature Importance Measures for Tree Models — Part I
Jul 27
[Learning Note] Single Shot MultiBox Detector with Pytorch — Part 3
Jul 26
[Learning Note] Single Shot MultiBox Detector with Pytorch — Part 2
Jul 24
[Learning Note] Single Shot MultiBox Detector with PyTorch — Part 1
2016
Jul 13
Beijing PM2.5 Concentration History 2015-2016
2015
Aug 29
First Step of Web Scraping in Go
Aug 13
Plotly Example: Deaths Caused By Cancer in Taiwan
Jul 31
Random Sampling Data with Header
Jul 28
Migrated the Blog from Pelican to Hugo
Jul 11
Bayesian Logistic Regression using PyMC3
Jul 7
Change Sources of Ubuntu in a Docker image
Jul 5
Docker: Remove All Untagged Images
Jan 22
Random Sampling at the Command Line
Jan 2
Implement FTRL-Proximal Algorithm in Go - Part 2
2014
Dec 9
Implement FTRL-Proximal Algorithm in Go - Part 1
Nov 29
The Power of PyPy
Apr 29
Tip for using iPython Notebooks in virtualenv
Mar 5
A simple script to automate MySQLdump backups
Feb 25
Shortcuts for some common statistical functions
2013
Oct 7
Dicussing the zen of python