is there a difference between Data Engineering and Data Science?

Data Engineering is the preparation, manipulation, and design of data for further analysis.

Data Science is an application domain to extract and build knowledge from data, a combination of CS and Statistics.

Common Data Formats:

  • CSV: (comma separate value)
  • JSON: (Javascript object notation)
  • HTML/XML: (hypertext markup language / extensible markup language)
  • SQL and NoSQL databases

CSV is easiest and extremely common.

lost? ~ Index - Data Engineering