A lot of time is spent cleaning data before the data science can begin.
Discovery
Schema Structuring for Relational DB’s
Cleaning
Enriching – this step is key to having good features
Validation
Publishing
A lot of this occurs during the ETL or ELT