Posts · Veritable Tech Blog

2023

Jul 16

[Notes] MaxViT: Multi-Axis Vision Transformer

2022

May 15

[Notes] PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions

Mar 14

[Notes] Understanding Visual Attention Network

Jan 28

[Notes] Understanding ConvNeXt

Jan 7

Use MPIRE to Parallelize PostgreSQL Queries

2021

Aug 27

[Notes] (Ir)Reproducible Machine Learning: A Case Study

Jul 25

[Notes] Understanding XCiT - Part 2

Jul 24

[Notes] Understanding XCiT - Part 1

Jun 13

How to Create a Documentation Website for Your Python Package

May 1

Text Analysis using Julia

Apr 18

How to Reduce the Loading Time of Julia Scripts

Apr 4

[Notes] Gradient Checkpointing with BERT

Mar 18

[Paper] Adafactor: Adaptive Learning Rates with Sublinear Memory Cost

Mar 14

Mistake I Made that Crippled My Streamlit App

Feb 19

A Case Study of fastcore @patch_to

Feb 14

[Paper] Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control

Jan 18

Reducing the SentencePiece Vocabulary Size of Pretrained NLP Models

2020

Dec 28

[Kaggle] Google Research Football 2020

Dec 22

[PyTorch Lightning] Log Training Losses when Accumulating Gradients

Dec 14

Generating Synthetic Tabular Data Using GAN

Dec 4

[Paper] Are We Really Making Much Progress?

Nov 23

Weird Behavior in the FiveThirtyEight 2020 Election Model

Nov 14

The Book Of Why: The New Science of Cause and Effect

Oct 17

Automatic Testing Your SQLite Database with Great Expectations

Oct 11

[Tensorflow] Training CV Models on TPU without Using Cloud Storage

Oct 7

Replicate Conda Environment in Docker

Sep 8

[Paper] Please Stop Permuting Features

Aug 19

[Paper] Language-agnostic BERT Sentence Embedding

Aug 5

[Competition] Jigsaw Multilingual Toxic Comment Classification

Jul 23

[Paper] Training Question Answering Models From Synthetic Data

Jul 11

[Tip] TorchScript Supports Half Precision

Jul 6

Self-Supervised Domain Adaptation

Jun 28

Using Julia to Do Whole Word Masking

Jun 16

[Failure Report] Distill Fine-tuned Transformers into Recurrent Neural Networks

May 4

Deploying EfficientNet Model using TorchServe

Apr 24

Tensorflow Profiler with Custom Training Loop

Apr 10

Monitor Python Script Cron Jobs using Telegram

Mar 31

Clutter-free Interactive Charts in R using Plotly

Feb 13

TensorFlow 2.1 with TPU in Practice

2019

Dec 17

Create a Customized Text Annotation Tool in Two Days - Part 2

Dec 16

Create a Customized Text Annotation Tool in Two Days - Part 1

Nov 28

Fine-tuning BERT for Similarity Search

Nov 9

[Notes] “Statistical Inference Enables Bad Science; Statistical Thinking Enables Good Science”

Oct 25

Pro Tip: Use Shutdown Script Detect Preemption on GCP

Sep 24

Zero Shot Cross-Lingual Transfer with Multilingual BERT

Aug 22

More Memory-Efficient Swish Activation Function

Aug 14

Customizing Spacy Sentence Segmentation

Aug 4

[Notes] Jigsaw Unintended Bias in Toxicity Classification

Jul 16

[Notes] iMet Collection 2019 - FGVC6 (Part 1)

Jun 25

Dealing with Synthetic Data

Jun 21

Smaller Docker Image using Multi-Stage Build

Jun 13

Mixed Precision Training on Tesla T4 and P100

May 23

[Notes] SHAP Values

Apr 24

Detecting Chinese Characters in Unicode Strings

Apr 9

A First Look at Plotly Express

Apr 4

Custom Image Augmentation with Keras

Mar 30

UMAP on RAPIDS (15x Speedup)

Mar 26

Use NVIDIA Apex for Easy Mixed Precision Training in PyTorch

Feb 15

Multilingual Similarity Search Using Pretrained Bidirectional LSTM Encoder

Feb 10

News Topic Similarity Measure using Pretrained BERT Model

Jan 22

Playing with rstudio/gt R Package

Jan 3

More Portable, Reproducible R Development Environment

2018

Dec 7

Use TextRank to Extract Most Important Sentences in Article

Nov 7

Implementing Beam Search - Part 2

Nov 5

Implementing Beam Search - Part 1

Oct 13

[Notes] Neural Language Models with PyTorch

Sep 25

Use Visual Studio Code To Develop Python Programs

Aug 10

Prepare Deep-Learning-Ready VMs on Google Cloud Platform

Jul 16

Quantile Regression — Part 2

Jul 12

Quantile Regression — Part 1

Mar 24

[Review] Kaggle Toxic Comment Classification Challenge

Feb 27

Analyzing Tweets with R

2017

Oct 28

Feature Importance Measures for Tree Models — Part I

Jul 27

[Learning Note] Single Shot MultiBox Detector with Pytorch — Part 3

Jul 26

[Learning Note] Single Shot MultiBox Detector with Pytorch — Part 2

Jul 24

[Learning Note] Single Shot MultiBox Detector with PyTorch — Part 1

2016

Jul 13

Beijing PM2.5 Concentration History 2015-2016

2015

Aug 29

First Step of Web Scraping in Go

Aug 13

Plotly Example: Deaths Caused By Cancer in Taiwan

Jul 31

Random Sampling Data with Header

Jul 28

Migrated the Blog from Pelican to Hugo

Jul 11

Bayesian Logistic Regression using PyMC3

Jul 7

Change Sources of Ubuntu in a Docker image

Jul 5

Docker: Remove All Untagged Images

Jan 22

Random Sampling at the Command Line

Jan 2

Implement FTRL-Proximal Algorithm in Go - Part 2

2014

Dec 9

Implement FTRL-Proximal Algorithm in Go - Part 1

Nov 29

The Power of PyPy

Apr 29

Tip for using iPython Notebooks in virtualenv

Mar 5

A simple script to automate MySQLdump backups

Feb 25

Shortcuts for some common statistical functions

2013

Oct 7

Dicussing the zen of python