This is one of the most interesting questions that I have ever come across, I always used to think why doesn’t anyone wants to know the backstory of Data Science, so let me tell you the history of data science. 

The origin of the word data science is more than 50 years ago. Did you know? That modern data science work, which is data analysis, was first used by John Tukey in 1962. 

After that, during a lecture given to the Chinese Academy of Sciences in Beijing, Professor C.F. Jeff Wu used the term “Data Science”. He used this term as an alternate word to the term statistics. And I think this is an apt word because, in statistics, most of the work is done using the data only. 

And then, in 1974, Peter Naur proposed the word data science as an alternative word for the term computer science.

Then in 1992, there was a statistics symposium at the University of Montpellier II where the emergence of a new discipline was acknowledged, which focused on the various forms of data which combined established concepts and statistics principles with the computation of data analysis. 

In 1996, the first conference that featured data science as a topic was the International Federation of Classification Societies. Then next year, in 1997, the professor who first termed the word data science proposed the idea to rename the word statistics with data science to break the stereotype that statistics is almost similar to accounting or being limited to just describing data, as there is much more to statistics then just being about telling data.

However, this proposal was declined as in 1998, Hayashi Chikio suggested that data science in itself is a new concept that focuses on three aspects which are: 

Data design 
Data collection 
Data analysis

We all know that during the 1990s, there was an internet revolution around the world. The data was being used, created, and modified many during those days. Finding patterns in the given data sets, analysing those patterns and trends and discovering the new data gave birth to another term called data mining. 

After all this, in the early 2000s, everyone was inquisitive about this field, the white paper was introduced, and journals were issued. Universities gave in their definition. And till now, this concept is considered to be a buzzword. There are so many branches in this field, so many subsets.  But I guess that we do not need to define this term as we are already doing a lot of practical things with data. 

What do you think about this? 



