Related Articles
Tokenizers: How machines read
We will cover often-overlooked concepts vital to NLP, such as Byte Pair Encoding, and discuss how understanding them leads to better models.
Statistical Experiments With Resampling
Bootstrapping and permutation tests Image by Mollyroselee from Pixabay Introduction Most people working with data make observations and then wonder whether these observations are statistically significant.…
PyMC Open Source Development
In this episode of Open Source Directions, we were joined by Thomas Wiecki once again who talked about the work being done with PyMC. PyMC3 is a Python package for Bayesian statistical modeling and Probabilistic Machine Learning focusing on advanced Markov chain Monte Carlo (MCMC) and variational inference (VI) algorithms. Its flexibility and extensibility make it applicable to a large suite of problems.
Distilling knowledge from Neural Networks to build smaller and faster models
This article discusses GPT-2 and BERT models, as well using knowledge distillation to create highly accurate models with fewer parameters than their teachers
Using Bayesian Networks to forecast ancillary service volume in hospitals
A Python example using diagnostic input variables Photo from Unsplash, by EJ Strat Since I’ve been working with healthcare data (almost 10 years now), forecasting future…