[Statistics] KKNR Ch.16 Selecting the Best Regression Equation.

미국유학/연구 2019. 5. 4. 10:05

The general problems to be discussed: we have one response variable Y and a set of k predictor variables X1, X2, X3...Xk; and we want to determine the best subset of the k predictors and the corresponding best-fitting regression model for describing the relationship between Y and the X's. What exactly we mean by "best" depends in part on our overall goal for modeling.

One goal is to find a model that provides the best prediction of Y, given X1,X2, ..., Xk, for some new observation or for a batch of new observations.

Alongside the question of prediction is the question of validity- that is, of obtaining accurate estimates for one or more regression coefficient parameters in a model and then making inferences about these parameters of interest. The goal here is to quantify the relationship between one or more independent variables of interest and the dependent variable, controlling when necessary for other variables.

1. Specify the maximum model (defined in Section 16.3) to be considered.

2. Specify a criterion for selecting a model.

3. Specify a strategy for selecting variables.

4. Conduct the specified analysis.

5. Evaluate the reliability of the model chosen.

By following these steps, one can convert the global goal of finding the best predictors of Y into simple, concrete actions. Each step helped to ensure reliability and to reduce the work required.

- Recall that overfitting a model (including variables in the model with truly zero regression coefficients in the population) will not introduce bias when population regression coefficients are estimated if the usual regression assumptions are met. We must be careful, however, to ensure that overfitting does not introduce harmful collinearity (ch. 14). Underfitting (i.e. leaving important predictors out of the final model), however, will introduce bias in the estimated regression coefficients.

저작자표시 비영리 변경금지 (새창열림)

'미국유학 > 연구' 카테고리의 다른 글

합의적 질적 연구 - Clara E. Hill (4) (0)	2019.10.20
합의적 질적 연구 - Clara E. Hill (3) (0)	2019.10.20
합의적 질적 연구 - Clara E. Hill (2) (0)	2019.10.20
합의적 질적 연구 - Clara E. Hill (1) (0)	2019.10.20
[Statistics] KKNR Ch.5 Determining the Best Straight Line (0)	2019.05.04

posted by sergeant

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

유학, 책, 여행 - 기록

검색결과 리스트

글

[Statistics] KKNR Ch.16 Selecting the Best Regression Equation.

설정

트랙백

댓글

'미국유학 > 연구' 카테고리의 다른 글

CATEGORY

TAG

RECENT POSTS

RECENT COMMENT

NOTICE

MY LINK

ARCHIVE

calendar

검색

COUNTER

티스토리툴바