*By Michaela Lang & Jigyasa Sakhuja, Data Scientists at NETZSCH Analyzing & Testing*In the first article of our Big Data series, we have already given you a first insight into the term Big Data and pointed out which benefits can be generated by data processing for production and especially for thermal analysis. In this next article, we would like to introduce the term Data Science in more detail and show some of its common methods.

## Definition of Data Science

As the term Data Science already describes, it is the science that deals with extracting valuable information from data. The goal is to use this information to improve a specific process in quality and efficiency, or even to gain new insights out of it. With the help of Data Science, it is possible to uncover correlations that cannot be easily recognized. The domain of Data Science comprises numerous different areas of expertise. Besides the mathematic / statistics and computer science, specialist knowledge plays a very important role. Especially in thermal analysis, it is necessary to understand and interpret the chemical and physical processes correctly in order not to derive wrong conclusions from measured data sets and to use the right methods for analysis. At*NETZSCH Analyzing & Testing*all necessary areas of expertise are available, so that with this advantage

*NETZSCH Analyzing & Testing*is able to apply Data Science methods in the field of thermal analysis. In the next section, we would like to present some methods of data analysis that are used in Data Science.

## Data Analytic Techniques

With a large amount of qualitative data, a data scientist can start the main task — turning the data set into valuable information. After data preprocessing, the data analysis can begin. In the following, it is described how to approach this challenge.### Data Exploration

With Data Exploration, the goal is to understand the data in a basic way. The structure of the data must be identified and distribution of the values is examined. With Data Exploration, we see first correlations between the data, and it enables us to find out which method is best to apply for the analysis.### Predictive Analysis

It is a subset of Business Intelligence and Business Analytics. During Predictive Analysis, the data sets are evaluated for patterns to be able to predict trends and future outputs. Several methods can be used for Predictive Analysis. In the following, we would like to give a short overview of some of these applications:#### Machine Learning:

#### Linear / Non-linear Regression:

#### Classification:

- Linear/Non-Linear Classification:

- Logistic Regression:

### Prescriptive Analysis

The main focus area is to find the best solution for the current data scenario. In addition to the Predictive Analysis, the Prescriptive Analysis provides recommendations on how to use the predicted information to influence the future. The goal is to use the information of prediction to analyze what decisions must be made to get the predicted result or to prevent it. The best prerequisite for good data analysis is a close exchange of the data scientists with the specialist department where the data to be analyzed come from. With years of experience and knowledge in thermal analysis, NETZSCH can apply Data Science methods in its field of expertise.## Preview

In the next article, we would like to introduce you to the world of Machine Learning and Artificial Intelligence. We want to show you the basics about it and give a sample of Machine Learning methods. So, stay curious about our next blog article of the Big Data series!#### References:

- https://www.logility.com/blog/descriptive-predictive-and-prescriptive-analytics-explained/
- https://datasolut.com/was-ist-machine-learning/
- https://entwickler.de/online/development/predictive-analytics-praxis-tipps-579847089.html
- https://medium.com/ml-research-lab/chapter-4-knowledge-from-the-data-and-data-exploration-analysis-99a72379733
- https://medium.com/ml-research-lab/chapter-4-knowledge-from-the-data-and-data-exploration-analysis-99a734792733
- https://www.researchgate.net/post/What_is_the_difference_between_linear_and_nonlinear_classification_techniques
- https://towardsdatascience.com/logistic-regression-detailed-overview-46c4da4303bc