Open in app

Sign In

Write

Sign In

Conor Mack
Conor Mack

2K Followers

Home

About

Published in Towards Data Science

·Jul 10, 2021

Why Bagging Works

In this post I deep dive on bagging or bootstrap aggregating. The focus is on building intuition for the underlying mechanics so that you better understand why this technique is so powerful. …

Data Science

3 min read

Why Bagging Works
Why Bagging Works
Data Science

3 min read


Published in Towards Data Science

·May 19, 2021

Structured natural language processing with Pandas and spaCy

Accelerate analysis by bringing structure to unstructured data — Working with natural language data can often be challenging due to its lack of structure. Most data scientists, analysts and product managers are familiar with structured tables, consisting of rows and columns, but less familiar with unstructured documents, consisting of sentences and words. For this reason, knowing how to approach…

Data Science

6 min read

Structured natural language processing with Pandas and spaCy
Structured natural language processing with Pandas and spaCy
Data Science

6 min read


Published in Towards Data Science

·May 15, 2021

Tail events, why they matter and how to model them

How to model the unexpected and unlikely the Bayesian way. Rare events are by definition, well, rare. But, inevitably they do happen and when they do they have outsized consequences. 9/11 was a tail event. The financial crisis of 2007/08 was a tail event. Coronavirus was a tail event. Many…

Data Science

6 min read

Tail events, why they matter and how to model them
Tail events, why they matter and how to model them
Data Science

6 min read


Published in Towards Data Science

·Apr 24, 2021

Fantastic features and where to find them

A few years ago I developed a model to identify fraudulent transactions for an online two-sided marketplace. My initial model was based on characteristics of the transaction and its context. This model was quite good, but I wanted to make it better. I was already using a Gradient Boosted Tree…

Data Science

7 min read

Fantastic features and where to find them
Fantastic features and where to find them
Data Science

7 min read


Apr 23, 2021

How much are 100 views worth on Medium?

This post is a short addendum to my longer analysis on earnings on Medium. In that post I estimated that one hour of member reading time is worth around $1.62 per day. One of the findings from that analysis is that internal member reading time is a stronger predictor of…

Data Science

2 min read

How much is 100 views worth on Medium?
How much is 100 views worth on Medium?
Data Science

2 min read


Apr 22, 2021

How much can you make on Medium? (or, how much I didn’t make with 200K+ views)

I started writing on Medium in 2017, but only recently joined the partner program. My most popular post (link below) has accumulated 210K views and 95 hours of member reading time. Awesome. In total, I have earned $3.33 from it. Not so awesome. …

Writing

6 min read

Earnings on Medium: a statistical analysis (or, how much I didn’t earn with 200K+ views)
Earnings on Medium: a statistical analysis (or, how much I didn’t earn with 200K+ views)
Writing

6 min read


Published in Analytics Vidhya

·Apr 14, 2021

Webpage optimisation with multiarmed bandits

Choosing the best layout, imagery and text to present to users on your webpage is hard. But, it doesn’t have to be. In fact *you* don’t need to choose — you can let data do it for you. In this post, I’m going to share a simple algorithm — implemented…

Startup

7 min read

How to optimize a webpage for clickthrough with simple Python code
How to optimize a webpage for clickthrough with simple Python code
Startup

7 min read


Published in Towards Data Science

·Apr 13, 2021

How to create custom scikit-learn classification and regression models

Scikit learn is *the* go to package for standard machine learning models in Python. It not only provides most of the core algorithms that you would want to use in practice (i.e. GBMs, Random Forests, Logistic/Linear regression), but also provides a wide range of tranforms for feature preprocessing (e.g. Onehot…

Data Science

2 min read

How to create custom scikit-learn classification and regression models
How to create custom scikit-learn classification and regression models
Data Science

2 min read


Published in Towards Data Science

·Apr 11, 2021

Lesser known data science techniques you should add to your toolkit

Being an effective data scientist often means being able to identify the right solution for a particular problem. In this post, I want to discuss three techniques that have enabled me to solve tricky problems across multiple contexts, but that aren’t widely used. The three techniques are quantile regression, exponential…

Data Science

7 min read

Lesser known data science techniques you should add to your toolkit
Lesser known data science techniques you should add to your toolkit
Data Science

7 min read


Published in Towards Data Science

·Apr 4, 2021

How to use Pytorch as a general optimizer

Pytorch is really fun to work with and if you are looking for a framework to get started with neural networks I highly recommend it — see my short tutorial on how to get up and running with a basic neural net in Pytorch here. What many people don’t realise…

Python

4 min read

How to use Pytorch as a general optimizer
How to use Pytorch as a general optimizer
Python

4 min read

Conor Mack

Conor Mack

2K Followers

Data Scientist, Economist, Pragmatist.

Following
  • Ben Rogojan

    Ben Rogojan

  • Isaiah McCall

    Isaiah McCall

  • Sam Warain

    Sam Warain

  • Netflix Technology Blog

    Netflix Technology Blog

  • Rishabh Sharma

    Rishabh Sharma

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech