Table of Content

2. Understanding Dependent and Independent Variables

4. Simple Linear Regression

5. Multiple Linear Regression

6. Assessing Model Fit and Statistical Significance

7. Interpreting Regression Coefficients

8. Handling Assumptions and Limitations

9. Conclusion and Further Applications

Regression Analysis: How to Estimate the Relationship between a Dependent Variable and One or More Independent Variables

1. Introduction to Regression Analysis

Introduction to Regression

Introduction to Regression Analysis

Regression analysis is a powerful statistical technique used to model and understand the relationship between a dependent variable (also known as the response or outcome variable) and one or more independent variables (predictors or explanatory variables). It is widely employed in various fields, including economics, social sciences, engineering, and natural sciences. In this section, we delve into the fundamentals of regression analysis, exploring its concepts, assumptions, and practical applications.

1. Perspectives on Regression Analysis:

- Statistical Perspective:

- From a statistical viewpoint, regression analysis aims to estimate the parameters of a linear or nonlinear model that best fits the observed data. The goal is to find a mathematical relationship that explains the variability in the dependent variable based on the values of the independent variables.

- linear regression assumes a linear relationship between the variables, while nonlinear regression allows for more complex functional forms.

- Statistical inference involves hypothesis testing, confidence intervals, and assessing goodness-of-fit metrics (e.g., R-squared).

- Econometric Perspective:

- In economics, regression analysis is a fundamental tool for understanding economic relationships. Researchers use it to quantify the impact of policy changes, market forces, and other factors on economic outcomes.

- Econometric models often incorporate lagged variables, instrumental variables, and fixed effects to account for endogeneity and omitted variable bias.

- machine Learning perspective:

- In the era of machine learning, regression techniques are essential for predictive modeling. Algorithms like ridge regression, lasso regression, and support vector regression extend the classical linear regression.

- Regularization methods prevent overfitting by adding penalty terms to the regression coefficients.

- Ensemble methods (e.g., random forests, gradient boosting) combine multiple regression models to improve prediction accuracy.

2. Types of Regression Models:

- simple Linear regression:

- Simple linear regression models the relationship between a single independent variable (e.g., advertising spending) and the dependent variable (e.g., sales).

- Example: Predicting house prices based on square footage.

- multiple Linear regression:

- Multiple linear regression extends the concept to multiple independent variables.

- Example: Predicting a student's GPA using study hours, attendance, and extracurricular activities.

- Polynomial Regression:

- Polynomial regression captures nonlinear relationships by including higher-order terms (e.g., quadratic or cubic terms).

- Example: Modeling the growth of a plant based on sunlight exposure.

- Logistic Regression:

- logistic regression is used for binary classification problems (e.g., spam detection, disease diagnosis).

- It estimates the probability of an event (e.g., customer churn) based on predictor variables.

3. Assumptions and Diagnostics:

- Linearity: The relationship between the variables should be approximately linear.

- Independence: Residuals (errors) should be independent and identically distributed.

- Homoscedasticity: The variance of residuals should be constant across all levels of predictors.

- Normality: Residuals should follow a normal distribution.

- Outliers and Influential Observations: detect and handle outliers that may affect the regression results.

4. Example: Predicting House Prices:

Let's consider a real estate dataset with features like square footage, number of bedrooms, and neighborhood. We want to predict house prices based on these variables. A multiple linear regression model can be formulated as:

\[ \text{Price} = \beta_0 + \beta_1 \cdot \text{SqFt} + \beta_2 \cdot \text{Bedrooms} + \beta_3 \cdot \text{Neighborhood} + \epsilon \]

- Interpretation: For a one-unit increase in square footage, the price changes by $\beta_1$ units, holding other variables constant.

- We assess model fit using R-squared, residual plots, and significance tests for coefficients.

In summary, regression analysis provides a versatile framework for understanding relationships, making predictions, and informing decision-making. Whether you're a statistician, economist, or machine learning practitioner, mastering regression techniques is essential for extracting valuable insights from data.

2. Understanding Dependent and Independent Variables

Understanding dependent and independent variables is crucial in regression analysis. These variables play a significant role in estimating the relationship between them. In this section, we will delve into the concept of dependent and independent variables, exploring their definitions and significance.

1. Definition of Dependent Variable:

The dependent variable, also known as the response variable, is the outcome or the variable of interest in a regression analysis. It is the variable that we seek to explain or predict based on the independent variables. For example, in a study analyzing the impact of study time on exam scores, the exam score would be the dependent variable.

2. Definition of Independent Variables:

Independent variables, also referred to as predictor variables, are the variables that are hypothesized to have an impact on the dependent variable. These variables are manipulated or controlled by the researcher to observe their effect on the outcome. In the previous example, the study time would be an independent variable.

3. relationship between Dependent and Independent variables:

The relationship between the dependent and independent variables is the core focus of regression analysis. It aims to quantify the extent to which changes in the independent variables affect the dependent variable. Regression models provide a mathematical representation of this relationship, allowing us to estimate the impact of each independent variable on the dependent variable.

4. Examples of Dependent and Independent Variables:

To illustrate the concept, let's consider a real-world example. Suppose we want to analyze the factors influencing housing prices. The dependent variable would be the housing price, while the independent variables could include factors like location, square footage, number of bedrooms, and proximity to amenities. By examining the relationship between these variables, we can determine which factors have a significant impact on housing prices.

5. Importance of Identifying Dependent and Independent Variables:

Accurately identifying the dependent and independent variables is crucial for conducting meaningful regression analysis.

Understanding Dependent and Independent Variables - Regression Analysis: How to Estimate the Relationship between a Dependent Variable and One or More Independent Variables

3. Types of Regression Models

Regression Models

In the section on "Types of Regression Models" within the blog "Regression Analysis: How to Estimate the Relationship between a Dependent Variable and One or More Independent Variables," we delve into the various regression models used in statistical analysis. Regression models are powerful tools for understanding the relationship between a dependent variable and one or more independent variables.

1. Simple Linear Regression: This model assumes a linear relationship between the dependent variable and a single independent variable. It estimates the slope and intercept of the line that best fits the data.

2. Multiple Linear Regression: In this model, we consider multiple independent variables to predict the dependent variable. It estimates the coefficients for each independent variable, allowing us to assess their individual contributions to the dependent variable.

3. Polynomial Regression: Sometimes, the relationship between the variables is not linear. Polynomial regression allows for curved relationships by including higher-order terms (e.g., quadratic or cubic) in the model.

4. Logistic Regression: Unlike linear regression, logistic regression is used when the dependent variable is categorical. It estimates the probability of an event occurring based on the independent variables.

5. Ridge Regression: This model is used when there is multicollinearity among the independent variables. It adds a penalty term to the regression equation, reducing the impact of correlated variables.

6. Lasso Regression: Similar to ridge regression, lasso regression also handles multicollinearity. However, it not only reduces the impact of correlated variables but also performs variable selection by setting some coefficients to zero.

7. elastic Net regression: Elastic net regression combines the properties of ridge and lasso regression. It addresses multicollinearity and performs variable selection simultaneously.

8. Time Series Regression: time series regression models the relationship between variables over time. It considers the temporal dependencies and can be used to forecast future values.

These are just a few examples of regression models used in statistical analysis. Each model has its own assumptions, strengths, and limitations. By understanding the different types of regression models, analysts can choose the most appropriate one for their specific research questions and data.

Types of Regression Models - Regression Analysis: How to Estimate the Relationship between a Dependent Variable and One or More Independent Variables

4. Simple Linear Regression

Linear regression

Simple Linear Regression

### Insights from Different Perspectives

1. The Geometric Viewpoint:

- Imagine a scatterplot where each point represents an observation. The x-axis corresponds to our independent variable, and the y-axis represents the dependent variable. Simple Linear Regression aims to find the best-fitting line through these points.

- The "best-fitting" line minimizes the sum of squared differences between the observed y-values and the corresponding predicted y-values (given by the regression line).

- This line serves as our predictive model, allowing us to estimate the dependent variable's value for any given independent variable value.

2. The Statistical Viewpoint:

- Simple Linear Regression assumes that the relationship between the variables can be expressed as:

$$y = \beta_0 + \beta_1x + \epsilon$$

- Here, $y$ is the dependent variable, $x$ is the independent variable, $\beta_0$ is the y-intercept, $\beta_1$ is the slope (regression coefficient), and $\epsilon$ represents the error term (residuals).

- Our goal is to estimate the values of $\beta_0$ and $\beta_1$ based on the observed data.

- The least squares method is commonly used to find the optimal values of $\beta_0$ and $\beta_1$.

3. The Practical Application:

- Let's consider an example: Suppose we want to predict a student's final exam score ($y$) based on the number of hours they studied ($x$).

- We collect data from several students, recording their study hours and exam scores.

- Using Simple Linear Regression, we find the line that best fits the data points. This line provides us with a predictive equation: $y = 50 + 5x$.

- For instance, if a student studied for 10 hours, we estimate their exam score as $y = 50 + 5 \cdot 10 = 100$.

### In-Depth Exploration

1. Estimating the Coefficients:

- The coefficients $\beta_0$ and $\beta_1$ are estimated using the following formulas:

- $\hat{\beta_1} = \frac{\sum_{i=1}^{n} (x_i - \bar{x})(y_i - \bar{y})}{\sum_{i=1}^{n} (x_i - \bar{x})^2}$

- $\hat{\beta_0} = \bar{y} - \hat{\beta_1}\bar{x}$

- Here, $\bar{x}$ and $\bar{y}$ represent the sample means of the independent and dependent variables, respectively.

2. assessing Model fit:

- We evaluate the goodness of fit using the coefficient of determination ($R^2$):

- $R^2 = \frac{\text{explained variation}}{\text{total variation}}$

- A higher $R^2$ indicates a better fit.

- Additionally, we examine the residuals (observed - predicted values) to check for patterns or outliers.

3. Hypothesis Testing:

- We can test whether the slope $\beta_1$ is significantly different from zero using a t-test.

- The null hypothesis is that $\beta_1 = 0$, implying no linear relationship.

### Example:

Suppose we have the following data:

| Hours Studied ($x$) | Exam Score ($y$) |

| 2 | 60 | | 4 | 75 | | 6 | 85 | | 8 | 95 |

Using Simple Linear Regression, we find:

- $\hat{\beta_0} = 55$

- $\hat{\beta_1} = 5$

Our predictive equation becomes: $y = 55 + 5x$.

Now, if a student studies for 7 hours, their estimated exam score is $y = 55 + 5 \cdot 7 = 90$.

Remember, Simple Linear Regression is just the beginning! As we explore more complex models, we unlock even greater insights into the relationships between variables.

Feel free to ask if

Simple Linear Regression - Regression Analysis: How to Estimate the Relationship between a Dependent Variable and One or More Independent Variables

5. Multiple Linear Regression

Multiple Linear

Linear regression

Multiple Linear Regression

Multiple Linear Regression: Understanding and Applying the Multivariate Model

1. The Essence of Multiple Linear Regression:

- Definition: MLR extends the concept of simple linear regression by incorporating two or more independent variables. The model assumes that the dependent variable is a linear combination of these predictors.

- Equation: The general form of an MLR equation is:

$$Y = \beta_0 + \beta_1X_1 + \beta_2X_2 + \ldots + \beta_pX_p + \varepsilon$$

Where:

- $Y$ represents the dependent variable.

- $\beta_0$ is the intercept.

- $\beta_1, \beta_2, \ldots, \beta_p$ are the coefficients corresponding to each predictor ($X_1, X_2, \ldots, X_p$).

- $\varepsilon$ denotes the error term.

- Assumptions:

- Linearity: The relationship between predictors and the response is linear.

- Independence: Errors are independent.

- Homoscedasticity: Constant variance of errors.

- No perfect multicollinearity: Predictors are not perfectly correlated.

- Insight: MLR allows us to account for the joint effect of multiple predictors, capturing their combined impact on the outcome.

2. Interpreting Coefficients:

- Each coefficient ($\beta_i$) represents the change in the dependent variable associated with a one-unit increase in the corresponding predictor, holding other predictors constant.

- Example: Suppose we model house prices ($Y$) based on square footage ($X_1$) and number of bedrooms ($X_2$). If $\beta_1 = 100$ and $\beta_2 = 20$, it means that for every additional square foot, the price increases by $100$, and for each extra bedroom, the price increases by $20$ (assuming no other changes).

3. Hypothesis Testing and Significance:

- Null Hypothesis: $H_0: \beta_i = 0$ (No effect of predictor $X_i$).

- Alternative Hypothesis: $H_a: \beta_i \neq 0$ (Significant effect).

- Test Statistic: We use t-tests to assess whether coefficients are significantly different from zero.

- Insight: A significant coefficient implies that the corresponding predictor matters in explaining the variation in the dependent variable.

4. Model Fit and R-squared:

- R-squared ($R^2$): Measures the proportion of variance in the dependent variable explained by the model.

- Adjusted R-squared: Penalizes adding unnecessary predictors.

- Interpretation: Higher $R^2$ indicates better fit, but beware of overfitting.

5. Handling Collinearity:

- VIF (Variance Inflation Factor): Detects multicollinearity.

- Remedies: Remove redundant predictors or combine them (e.g., principal component analysis).

6. Assumptions Validation:

- Residual Analysis: Check residuals for normality, homoscedasticity, and independence.

- Cook's Distance: Identify influential observations.

7. Example:

- Suppose we want to predict a student's final exam score ($Y$) based on study hours ($X_1$) and previous test scores ($X_2$).

- Our MLR model:

$$\text{Score} = 50 + 5X_1 + 10X_2 + \varepsilon$$

- Interpretation: For each additional study hour, the expected score increases by 5 (holding previous scores constant).

In summary, MLR empowers us to explore complex relationships, control for confounding factors, and make informed predictions. Remember that while MLR provides valuable insights, proper model validation and understanding of assumptions are crucial for robust results.

Multiple Linear Regression - Regression Analysis: How to Estimate the Relationship between a Dependent Variable and One or More Independent Variables

6. Assessing Model Fit and Statistical Significance

Assessing model

Assessing model fit

Statistical Significance

In this section, we will delve into the important topic of assessing model fit and statistical significance in regression analysis. It is crucial to evaluate the goodness of fit of a regression model to determine how well it captures the relationship between the dependent variable and the independent variables.

From a statistical perspective, there are several methods to assess model fit. One commonly used measure is the coefficient of determination (R-squared), which indicates the proportion of the variance in the dependent variable that is explained by the independent variables. A higher R-squared value suggests a better fit of the model.

Another approach is to examine the residuals, which are the differences between the observed values and the predicted values from the regression model. By analyzing the residuals, we can assess whether the model adequately captures the patterns and variability in the data. If the residuals exhibit a random pattern with no discernible trends, it indicates a good fit.

Additionally, hypothesis testing can be employed to determine the statistical significance of the regression coefficients. This involves testing whether the coefficients are significantly different from zero. The t-test and p-value are commonly used in hypothesis testing, where a low p-value indicates that the coefficient is statistically significant.

To provide a more comprehensive understanding, let's explore some key insights:

1. Multicollinearity: It is essential to check for multicollinearity among the independent variables. Multicollinearity occurs when there is a high correlation between two or more independent variables, which can lead to unstable and unreliable coefficient estimates.

2. Outliers: outliers are data points that deviate significantly from the overall pattern. They can have a substantial impact on the regression model's fit and should be carefully examined. Outliers can be identified using various techniques, such as the studentized residuals or leverage statistics.

3. Model Comparison: It is often beneficial to compare different models to determine the best fit. This can be done using techniques like the akaike Information criterion (AIC) or the bayesian Information criterion (BIC), which consider both model fit and complexity.

4. Interaction Effects: Interaction effects occur when the relationship between the dependent variable and an independent variable depends on the value of another independent variable. Including interaction terms in the regression model can improve its fit and capture these complex relationships.

Let's illustrate these concepts with an example. Suppose we are examining the relationship between a person's age, education level, and income. By assessing model fit and statistical significance, we can determine how well the regression model explains the variation in income based on age and education level, considering factors like multicollinearity, outliers, and interaction effects.

Remember, assessing model fit and statistical significance is crucial in regression analysis to ensure the reliability and validity of the results. By employing these techniques and considering various insights, we can gain a deeper understanding of the relationship between the dependent and independent variables.

Assessing Model Fit and Statistical Significance - Regression Analysis: How to Estimate the Relationship between a Dependent Variable and One or More Independent Variables

7. Interpreting Regression Coefficients

1. Magnitude and Significance:

- The first thing we notice about regression coefficients is their magnitude. A positive coefficient indicates that an increase in the corresponding independent variable leads to an increase in the dependent variable (and vice versa for a negative coefficient).

- But how do we determine if the coefficient is statistically significant? We look at the p-value associated with each coefficient. A low p-value (typically less than 0.05) suggests that the coefficient is significantly different from zero.

- Example: Suppose we're modeling house prices based on square footage. A positive coefficient of 100 means that, on average, each additional square foot increases the house price by $100. If the p-value is small, we can be confident in this relationship.

2. Unit Interpretation:

- Coefficients are expressed in the units of the dependent variable. For instance, if we're predicting salary (in dollars) based on years of experience, a coefficient of 5000 means that each additional year of experience corresponds to a $5000 increase in salary.

- It's essential to consider the scale of the variables. If we change the unit (e.g., from years to months), the coefficient will also change.

3. Interaction Terms:

- Sometimes, the impact of an independent variable depends on the value of another variable. Interaction terms capture these complex relationships.

- For instance, in a model predicting customer satisfaction, the effect of price on satisfaction might differ for different age groups. We can include an interaction term (e.g., "price × age group") to account for this.

4. Control Variables:

- Regression coefficients help us control for confounding factors. By including relevant independent variables, we isolate the effect of interest.

- Suppose we're studying the impact of advertising spending on sales. We should control for other factors like seasonality, competitor activity, and economic conditions.

5. Standardization:

- Standardized coefficients (beta coefficients) allow us to compare the relative importance of variables. They express the change in the dependent variable per standard deviation change in the independent variable.

- Example: If the coefficient for education level is 0.2 (standardized), it means that a one-standard-deviation increase in education corresponds to a 0.2 standard-deviation increase in the dependent variable.

6. Categorical Variables:

- When dealing with categorical predictors (e.g., gender, region), we use dummy variables. The coefficients represent the difference in the dependent variable between the reference category and each other category.

- Interpretation becomes more nuanced here. For instance, if we have a dummy variable for "male," the coefficient represents the average difference in the dependent variable between males and females.

7. Collinearity:

- High correlation between independent variables (multicollinearity) can lead to unstable coefficients. It becomes challenging to isolate the effect of each variable.

- Detecting collinearity involves examining variance inflation factors (VIFs). If VIFs are too high (usually above 5 or 10), we might need to drop or combine variables.

Remember that interpreting regression coefficients is both an art and a science. Context matters, and domain knowledge guides our understanding. So, next time you encounter a regression output, dive into those coefficients—they hold valuable stories about the relationships between variables!

Feel free to ask if you'd like more examples or have any specific questions!

Interpreting Regression Coefficients - Regression Analysis: How to Estimate the Relationship between a Dependent Variable and One or More Independent Variables

8. Handling Assumptions and Limitations

Assumptions and Limitations

1. Assumptions in Regression Analysis:

- Linearity Assumption: One of the fundamental assumptions in linear regression is that the relationship between the dependent variable (DV) and independent variables (IVs) is linear. However, real-world data often deviates from perfect linearity. For instance, consider predicting housing prices based on square footage. While a linear model might suffice for small houses, larger properties may exhibit diminishing returns.

- Homoscedasticity Assumption: Homoscedasticity implies that the variance of the residuals (errors) remains constant across all levels of the IVs. Violations of this assumption can lead to biased coefficient estimates. Imagine modeling stock returns based on company size. If the variability of returns increases disproportionately for larger firms, our model assumptions are compromised.

- Independence Assumption: Regression assumes that residuals are independent of each other. In time-series data, this assumption is often violated due to autocorrelation. For instance, predicting monthly sales based on historical sales data introduces temporal dependencies.

- Normality Assumption: While normality isn't strictly required for large sample sizes (thanks to the Central Limit Theorem), it's still relevant. If residuals significantly deviate from normality, confidence intervals and hypothesis tests may be affected. Suppose we're modeling exam scores based on study hours. If the residuals exhibit skewness, our inferences could be misleading.

2. Handling Assumptions:

- Transformations: When linearity assumptions are violated, consider transformations like logarithmic or polynomial functions. For instance, if predicting crop yield based on fertilizer usage, a log transformation might stabilize the relationship.

- robust Standard errors: To address heteroscedasticity, use robust standard errors. These account for varying residual variances and provide more accurate confidence intervals.

- Residual Plots: Visualize residuals against predicted values. Detect patterns (e.g., U-shaped residuals) that hint at nonlinearity or heteroscedasticity.

- Bootstrapping: When normality assumptions are questionable, bootstrapping can provide robust confidence intervals. For instance, when modeling patient recovery time based on treatment dosage, bootstrapping can handle skewed residuals.

3. Limitations of Regression Analysis:

- Causality vs. Correlation: Regression identifies associations, but causality requires experimental design. Suppose we're studying the impact of exercise on weight loss. Regression can't prove that exercise causes weight loss; it only reveals an association.

- Multicollinearity: When IVs are highly correlated, multicollinearity occurs. It inflates standard errors and makes coefficient interpretation tricky. For example, predicting gdp growth using both inflation rate and interest rates may lead to multicollinearity.

- Outliers: Regression is sensitive to outliers. A single extreme data point can significantly affect the model. Imagine predicting employee productivity based on work hours; an outlier (e.g., an employee working 24/7) can distort results.

4. Examples:

- Example 1: Suppose we're modeling customer satisfaction (DV) based on response time and product quality (IVs). Residual plots reveal heteroscedasticity. We apply robust standard errors to address this.

- Example 2: Predicting home energy consumption (DV) using temperature and humidity (IVs). Residuals exhibit skewness. We bootstrap to estimate confidence intervals.

Remember, acknowledging assumptions and limitations enhances the rigor of our analyses. By embracing uncertainty, we pave the way for more robust and reliable regression models.

Handling Assumptions and Limitations - Regression Analysis: How to Estimate the Relationship between a Dependent Variable and One or More Independent Variables

9. Conclusion and Further Applications

In this comprehensive exploration of regression analysis, we've delved into the intricacies of estimating relationships between dependent and independent variables. As we wrap up our journey, let's reflect on the key takeaways and consider the exciting possibilities for further applications.

1. Summarizing Insights:

- From a statistical perspective, regression analysis allows us to quantify the impact of one or more independent variables on a dependent variable. Whether we're predicting sales based on advertising expenditure or assessing the effect of education level on income, regression provides valuable insights.

- The coefficient of determination (R-squared) helps us understand how well our model explains the variability in the dependent variable. A high R-squared indicates a strong fit, while a low value suggests room for improvement.

- Heteroscedasticity, multicollinearity, and autocorrelation are common challenges in regression. Addressing these issues ensures robust results.

2. Practical Applications:

- Marketing: Imagine a company launching a new product. By analyzing historical data, they can use regression to estimate how changes in marketing spend (independent variable) affect sales (dependent variable). Armed with this knowledge, they can optimize their marketing budget.

- Economics: Economists often employ regression to study the impact of policy changes. For instance, they might examine how changes in interest rates (independent variable) influence GDP growth (dependent variable).

- Healthcare: Regression models help predict patient outcomes based on factors like age, BMI, and medical history. Researchers can identify risk factors for diseases and tailor interventions accordingly.

3. Predictive Power:

- Regression isn't limited to historical analysis. It's a powerful tool for prediction. Suppose we're building a credit scoring model. By considering variables like income, credit history, and debt-to-income ratio, we can estimate the likelihood of loan default.

- Linear regression, polynomial regression, and logistic regression are just a few variants that cater to different scenarios.

4. Beyond Linear Relationships:

- While linear regression assumes a linear relationship between variables, real-world phenomena often exhibit more complex patterns. Nonlinear regression models capture curvilinear trends.

- Interaction terms allow us to explore how the effect of one variable changes based on the value of another. For instance, the impact of education on income might differ for different age groups.

5. Caveats and Caution:

- Remember that correlation doesn't imply causation. A strong relationship between variables doesn't necessarily mean one causes the other.

- Be cautious when extrapolating beyond the range of observed data. Predictions may become unreliable.

- Always validate your model using cross-validation and assess its performance on unseen data.

6. Future Directions:

- machine learning techniques like random forests, gradient boosting, and neural networks extend regression's capabilities. These models handle nonlinearities and interactions more effectively.

- Big data opens up exciting possibilities. With massive datasets, we can explore subtle relationships and build more accurate models.

- Bayesian regression incorporates prior knowledge and uncertainty, making it suitable for decision-making under uncertainty.

Regression analysis is a versatile tool that empowers researchers, analysts, and decision-makers. As we continue to refine our understanding and explore novel applications, we unlock new avenues for knowledge and progress.

Conclusion and Further Applications - Regression Analysis: How to Estimate the Relationship between a Dependent Variable and One or More Independent Variables