Data Wranglers

A lot of time is spent cleaning data before the data science can begin.

Discovery
Schema Structuring for Relational DB’s
Cleaning
Enriching – this step is key to having good features
Validation
Publishing

A lot of this occurs during the ETL or ELT

Advertisements
This entry was posted in data wrangling and tagged . Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s