Analytics Dispatch Archive

A weekly email about data, data science, and analytics. Curated by the team at Mode.

Artwork personalization at Netflix

Chromebook data science. Shaky CDC data. An NLP history lesson.

October 8, 2018

Build your own deep learning computer

Citizen data scientists. ML for coders. Connections across America.

October 1, 2018

Visualizing uncertainty

Anatomy of an AI system. Reproducing ML projects. 5 public datasets to explore.

September 24, 2018

Retracing your steps in machine learning

Strata slides. A DS ethics checklist. Life expectancy in your neighborhood.

September 17, 2018

Who is Anonymous?

PhD considerations. SQL vs Python. Machine translators.

September 10, 2018

China reigns with data

California's wild fires. Integrating Ml into a product. Twitter toxicity.

August 27, 2018

The patriarchy of pockets

SQL queries for Salesforce. The beauty of annotations. Capturing data evolution.

August 20, 2018

The hazards of A/B testing

BigQuery table clusters. Unpacking an NLP Twitter thread. First days on the job.

August 13, 2018

Know your blindspot

3 million troll tweets. The Holy Grail of email. Partitioning variation.

August 6, 2018

Political bubbles

IMDb analysis. Differentiatiable image parameterizations. ACL 2018 highlights.

July 30, 2018

Reinforcement learning's fundamental flaw

W. E. B. Du Bois' data viz. Machine learning glossary. Speedier R work.

July 23, 2018

Doing good data science

What do ML practitioners do? Ditching microservices. Feature-wise transformations.

July 16, 2018

Apple Maps is reborn

What 85-year-olds are up to. Red flags in interviews. 12 ggplot2 extensions.

July 9, 2018

LeBron's next pick

2018 data viz survey. Data engineering frameworks. ML papers with code.

July 2, 2018

The best Mario Kart character

Constrained optimization. Census oddities. Bias-variance tradeoff.

June 25, 2018

Predicting the World Cup

ML's future is tiny. One year as a data scientist. Problems with gender classification.

June 18, 2018

Remote vs non-remote workers

Shazam, for Congress. The future of data engineering. Trustworthy data analysis.

June 11, 2018

Is UTC enough?

Better training data. Rethinking academic data sharing. Volcanic history.

June 4, 2018

An abundance of Ns

purrr tutorial. LaCroix color palettes. The challenges of Smart Compose.

May 21, 2018

The structure of standup

Strategies for optimizing Python code. The NYC subway crisis. Russian Facebook ads.

May 14, 2018

Terrifying talks

Attracting top notch candidates. Data violence. Linear vs log scale.

May 7, 2018

The taxonomy of food

Grubhub's seemingly impossible data problem. Qualitative before quantitative. R package dependencies.

April 30, 2018

Space junk

Mode Studio. A Shiny app for Fido. Streaming 100 billion analytics events.

April 23, 2018

The future is modular

Why data scientists should take a hippocratic oath. Machine learning at Conde Nast. New viz tools.

April 16, 2018

CNN vs MSNBC vs Fox

Odes to notebooks. Overcoming objections. Lumpers and splitters.

April 9, 2018

The evolution of Stephen Curry

Academia → industry. Lessons from video games. TensorFlow sans setup.

April 2, 2018

You've got one (million) shot(s)

A massive NCAA data set. GANs + art. The benefits of blameless postmortems.

March 26, 2018

Ethical code

Stack Overflow's developer survey. Docker for deep learning. Data-driven unit testing.

March 19, 2018

How neural networks decide

SQL → Pandas. Prophet in Mode. Visualizing outliers.

March 12, 2018

300 years of data viz

Love for the star schema. 8 in-app analytics examples. “Wall time” semantics.

March 5, 2018

How long's the wait?

Down with pipeline debt. Malicious use of AI.

February 26, 2018

All you knead is love

Pythonic cookies. A data viz engineer definition. Manifesto for data practices.

February 19, 2018

Quad-tum mechanics

The Olympics. ML models you haven't built. Visualizing missing data.

February 12, 2018

Why recommendation engines fail

Awesome in-app analytics. Another data privacy mishap. DJ Patil rallies the troops.

February 5, 2018

The Oscars game plan

Window functions in Python & SQL. The future of pandas. Free resource roundup.

January 29, 2018

Imposter no longer

Data engineering for dummies. The mortality rate of JS frameworks. A new DS podcast.

January 22, 2018

Follow the tea trails

Graphics reporter Q&A. Automating front-end. Job hunting post mortem.

January 15, 2018

Where athletes come from

Selecting a cloud provider. Academia → industry. Early-stage analytics.

January 8, 2018

No free lunch

Junior DS roles. A literal gamechanger. ML technical debt.

January 2, 2018

Google Maps' Moat

The next Bechdel Test. PyCon proposal myths. A re:Invent recap.

December 26, 2017

Now in 3D!

Apache Airflow, explained. The 3rd dimension of customer success analysis. Molecule's custom reports.

December 18, 2017

Postgres is hip again

U.S. AI threatened. NIPS highlights. Median pitfalls.

December 11, 2017

Hadoop or laptop?

Netflix's A/B test alternative. Predicting palliative care. Building a deep learning library.

December 4, 2017

A measure of fairness

Ethics in practice. Data meta-metrics. Numerical optimization.

November 27, 2017

Tracking your Thanksgiving

Crossword heatmaps. Generative music. Tips for building a diverse data team.

November 20, 2017

Trophy data scientist

The 3-degree world. Causal inference. Changepoint analysis.

November 13, 2017

What's real?

Halloween episodes. Ethical responsibility. Word cloud designs.

November 6, 2017

Streaming Spotify data

4 data mistakes startups make. Interviewing data scientists. A massive font database.

October 30, 2017

Trust the process

NBA analytics. Power calculations. State of data journalism.

October 23, 2017

A dirty dozen

12 A/B test pitfalls to avoid. Taxis vs cabs vs the subway. Evaluating ETL tools.

October 16, 2017

The Nate Silver Effect

zulily's data platform. R for journalists. The NYC job search.

October 9, 2017

Something's rotten

Debunking studio exec claims. An ETL company's stack. What closes deals.

October 2, 2017

Problems with probability

Accelerating GeoPandas. New R community. Making analytics meaningful.

September 25, 2017

Data security for data scientists

Finding the best data jobs. Legos + text mining. Scalable machine learning.

September 18, 2017

Troubleshooting neural networks

10x data scientists. 30 years of hurricanes. Communicating uncertainty.

September 11, 2017

Improving the Zestimate

Language gaps. Packaging metrics. A Python cheat sheet.

September 4, 2017

The Data Trust Gap

Foursquare's location intelligence. Giving your first data science talk. 10 Python mistakes.

August 28, 2017

Exposing espionage

Optimizing for Burning Man. Choosing an ETL tool. Scaling with Python.

August 14, 2017

Poetic Python

Cargo cult data science. Millions of Intercom messages. Query optimization.

August 7, 2017

Facebook's AI factory

Predicting LTV at Airbnb. Technical debt in ML. What's difficult about histograms.

July 31, 2017

Machine learning at Apple

Gender representation in comics. Data systems. Designing enterprise tables.

July 24, 2017

Graphing Jane Austen's genius

Joy plots. How to spot a misleading graph. Marrying UX & ML.

July 17, 2017

Flying blind

Rise of the data PM. Augmented reality viz. New NYC boroughs.

July 10, 2017

Not Hotdog

Optimizing Reddit submissions. R at Microsoft. Blogging about data.

July 3, 2017

A tale of two axes

Millions of doodles. 2 years at Stack Overflow. Coding on the go.

June 26, 2017

Going vertical

3 stages of data infrastructure. 29 common Python errors. 200,000 Uber and Lyft trips.

June 19, 2017

Big data B.S.

Root cause analyses. How histograms work. Analytics at Athos.

June 12, 2017

Disney's algorithms

The MLB's new metric. How to hire a product analyst. The Paris Agreement.

June 5, 2017

Mr. Rogers' rainbow

Airbnb's Data University. 30 GBs of federal payroll records. The top DS software.

May 29, 2017

The Emoji States of America

Big news from Mode. The Hitchhiker's Guide to d3.js. Detecting overspend in AWS.

May 22, 2017

Counting the hours

Duolingo's language learning model. Etsy's development process. Instacart's strategy for building DS teams.

May 15, 2017

Airbnb's North Star

Winning marital arguments with R. 3 million Instacart orders. Dashboards that deliver.

May 8, 2017

100 billion events

Spotify's event delivery system. Craft beers and Python. Data viz vs UI.

May 1, 2017

Architecture of Giants

Machine learning flash cards. Teaching SQL. Analytics trends in 2017.

April 24, 2017

The Stats of the Furious

Statistics in D3. Proving yourself without a degree. More on interactive viz.

April 17, 2017

I saw the sine

Avoiding analytic rabbit holes. The Data Wheel of Death. Rebuilding an analytics stack.

April 10, 2017

Winning at Scrabble

ML for product managers. Analytics for startup founders. Scrabble analyses.

April 3, 2017

Subreddit math

Group-by from scratch. Corporate data viz. Test-driving Prophet.

March 27, 2017

Perl is dead

Switching programming languages. Data hackathons. Is interactive viz done for?

March 20, 2017

Open source burnout

A data GIF tutorial. DS on the Silicon Beach. Blind date data.

March 13, 2017

Testing time series

Hiring a data scientist. The future of Airflow. Advice for switching careers.

March 6, 2017

The Zero Bug

Predicting earthquake preparedness, partisan conflict, and feature engineering.

February 27, 2017

Closed data

Online DS courses, ranked. Critical data literacy. Unlearning descriptive statistics.

February 20, 2017

Remembering Hans Rosling

Spotting visualization lies. Data humanism. Encoding categorical values.

February 13, 2017

Finding fake news

Mode's stance on Trump. ML at Fitbit. The cleanest NYC restaurants.

February 6, 2017

The hottest year yet—again

Data science at Stitchfix. ML videos. A data engineer's manifesto.

January 30, 2017

How stats lost their power

Redefining “AI.” Behind-the-scenes of sports analytics. Building a master data dictionary.

January 23, 2017

A freelancer's tale

Uber Movement. Q&A w/ Monica Rogati. Visual vocabulary.

January 16, 2017

Sack the coach

The NFL and causal inference. Generating poetry with Python. Classifiers from scratch.

January 9, 2017

The great TV divide

Mid-career pivots. TV fandoms. Rationality + empathy.

January 2, 2017

Bringing down the Empire

Star Wars casualties. The state of the DS job market. CAC calculations.

December 26, 2016

Guerrilla archive

#DataRefuge. A chat with DJ Patil. Analyzing Google trends.

December 19, 2016

Analyze the rainbow

Skittles debates. Time series analysis in Python. Ditching vanity metrics.

December 12, 2016

Look ma, no polls!

A data detective story. BitTorrent for professors. Seasonality in search engines.

December 5, 2016

The good ol' days

Rebuilding trust in analytics, data limitations, and a text analysis tutorial.

November 28, 2016

Data wrangling Westworld

Data skills we all need, election post mortems, and runner routes.

November 21, 2016

The non-election issue

UFO sightings data. 415 viz tools. The science of unpredictability in... science.

November 14, 2016

When charts lie

Why data projects fail. An AI speechwriter. The end of baseball's analytics war.

November 7, 2016

Double the Trump

Flash forecasting. The father of soccer analytics. A new viz technique.

October 31, 2016

The David Spade Index

The problem with North Star metrics, the secret to designing smart products, and the popular vote.

October 24, 2016

Data in the deep fryer

The impact of outliers, marathon performance, and why machine learning is like deep frying.

October 17, 2016

Studying The Simpsons

Nobel Prize winners, your typical farmers market, and The Simpsons side characters.

October 10, 2016

Data movie magic

The year's best data visualizations, fact checking the debate, and movie magic with data viz.

October 3, 2016

Polling the pollsters

Gender roles in Hollywood, stats for soccer fans, and four results from one election poll.

September 26, 2016

Income rising

Summary analysis, creativity in data viz, and the income increase.

September 19, 2016

The state of data engineering

Digital economists, swing states, the art of asking good questions.

September 12, 2016

What is Bayesian, really?

The pros and cons of urban cycling, rebuilding a Graphics team, and the joys of dot plots.

September 5, 2016

In English, please

One color palette generator, 8 Python data cleaning libraries, and the fastest men in the world.

August 29, 2016

End the language war

Visualizing clickbait, counting conundrums, and the problem with the Rio pool.

August 22, 2016

Data journalists go for gold

Trump tweets, Olympic data viz, and tips for designing better tables.

August 15, 2016

Visualizing Slack

A Star Trek network viz, ethics for algorithms, and the Olympics.

August 8, 2016

Lies and statistics

Data viz developments, dodgy statistics, and genomics.

August 1, 2016

Pokémon pandemonium

Amazon reviews, Bayesian thinking explained visually, and dashboard design.

July 25, 2016

Half a decade of drought

Pop music genealogy, FiveThirtyEight's R workflow, and a series of stunning drought maps.

July 18, 2016

Icarus, Oedipus, Cinderella

Data mining story arcs, theories of everything, and the history of the infographic.

July 11, 2016

Brunch so hard

Feature engineering, cartogram challenges, and an analysis Leslie Knope would love.

July 4, 2016

Brexit breakdowns

Plus: Mode now supports Plotly, data science portfolios, and fantasy football.

June 27, 2016

The 'Hamilton' algorithm

Escaping Excel hell, real-time dashboards, and the Data Journalism Awards.

June 20, 2016

Reddit's favorite field of science

Pie chart research, a Python cheat sheet, and machine learning for sales.

June 13, 2016

The influence of 'In da Club'

50 years of pop music, hybrid intelligence, and HBR data viz advice.

June 6, 2016

Following in Apple’s footsteps

Tighter data security, Foursquare + Uber, and data anonymization best practices.

May 30, 2016

Calculus not required (plus big news from Mode!)

News on the Python front, 24 charting tools, and SF rental prices.

May 23, 2016

A tale of two types of journalism

Sales data, pandas video tutorials, and data science in healthcare.

May 16, 2016

Beyond 'The Touchscreen Generation'

Kalman filters, pandas tutorials, and why newsrooms should own their data.

May 9, 2016

Foursquare the prophet

The power of proprietary data, pirated papers, and a glorious data viz catalog.

May 2, 2016

Googling 'Game of Thrones'

12 data science methods and 1 big HBO show.

April 25, 2016

Kobe's last shot

Thumbtack's data stack, storytelling at Jawbone, and 15 data viz interviews.

April 18, 2016

Big Brother grew wings

FBI spy planes, measuring MRR, and the Hollywood gender gap.

April 11, 2016

Mathematics is coming

Game of Thrones, scaling the data science org, and conversion optimization.

April 4, 2016

Pity the pie chart

Lift analysis, genocidal chatbots, and the plight of pie charts.

March 28, 2016

Harmony

Moneyball for book publishers, CAC, and the engineer-analyst relationship.

March 21, 2016

Fuzzy numbers

Back-end analytics, help center metrics, and predicting police misconduct.

March 14, 2016

Pointers from Steph

Shattering NBA records, 2 million chess games, and statistically significant growth hacking.

March 7, 2016

Break out the bubbly

Punctuation in code, PM employee onboarding advice, and practical data science skills.

February 27, 2016

Do the math

If Facebook were a pollster, BuzzFeed analytics, and the virtues of keeping things simple.

February 21, 2016

A solar system of 'Tainted Love'

Interstellar cover songs, 10 TED talks, and the presidential primary.

February 14, 2016

Mind the map gap

Central Africa's dearth of data, alternatives to open data portals, and data viz empathy.

February 6, 2016

A new kind of parasite

Flint failings, research parasites, and Disney princess linguistics.

February 1, 2016

13 years after Moneyball

Sabermetrics, DAU, and holiday shopper retention.

January 25, 2016

Squirrels gone wild

Edtech analytics, Python prep, and Powerball.

January 18, 2016

Hamlet's social network

Missing ordinals, football analytics, deep-learning chips, and more.

January 11, 2016

A decade in the life

'Star Wars,' random projection, inviting dissent, and Nick Felton's final report.

January 4, 2016

Getting closer to 'Her'

A machine intelligence progress report, mesmerizing viz, insightful data science talks, and delivery analytics.

December 21, 2015

The fashionable side of data science

Best practices, Google's effect on the 2016 election, climate change, p-values, and data-related stocking stuffers.

December 14, 2015

Hello, World!

Smiles, agriculture, Airbnb's data release, and more.

December 7, 2015