0% found this document useful (0 votes)

5 views

1_LogisticRegressionNotes1

This document provides comprehensive notes on logistic regression, including its mathematical foundations, transformations, and interpretations of parameters. It explains the logit transformation, log odds, and logistic transformation necessary for modeling binary outcomes, along with practical examples in R for predicting gender based on height and assessing model performance. Additionally, it discusses the importance of choosing an appropriate cutoff for predictions and evaluating model accuracy.

Uploaded by

Protik Deb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

1_LogisticRegressionNotes1

Uploaded by

Protik Deb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Logistic Regression Notes

Contents
The range of response in linear regression 1

Logistic regression 1
Logit transformation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
Log odd . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Logistic transformation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Interpretation of parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Prediction: male or female . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Model performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

Example in R 6
The data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Descriptive statsitics and visualization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Make logistic model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Parameter interpretation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
Calculate log odd and fitted probability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Get error rate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Validation of model: training and testing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

The range of response in linear regression

In linear regression, we modeled the response variable as a linear combination of the predictor variables. For
one predictor variable, the model is shown below:

Yi = β0 + β1 Xi + i

Now, if the predictor variable is not constrained, that is, if X can take any value, ranging from −∞ to +∞,
then the response Y also becomes unconstrained. The range of Y becomes the entire real number.

Logistic regression
Suppose we are performing a linear regression taking a person’s weight as response and height as predictor.
Here the response is a continous numeric variable. But if the response is a binary variable and can take just
two values (for example, male vs female, bad vs good, pass vs fail, win vs loss etc), the scenario becomes a
little special and we need special method to handle that.
In logistic regression, the response is a categorical variable with two levels. We code them with 1 and 0.
When we model it mathematically, we model the probability of one of levels (for example, probability of
being female, probability of win, probability of pass etc). When we model probability of being female, male
is our reference level. Similarly, when we model probability of win, lose is our reference.
Suppose we have a male/female response. Our reference is “female” and our response is Y . In the logistic
regression, we model the probability of being male, that is P (Y = male). Since Y can take either “male” or
“female”, then P (Y = male) + P (Y = f emale) = 1 must be satisfied.

1
Now let us make attempt to model our response like we did in the simple linear regression. Let us assume we
have one predictor X. We propose the following model:

P (Yi = male) = β0 + β1 Xi + i

Now there is one problem with the above model. We are modeling a “probability” here. And a “probability”
cannot be less than zero or greater than 1. But the range of the right hand side can take any real number
(given X is not constrained). So our proposed model is fundamentally wrong.
Let us try an example with a simulated data. We have heights and gender of 70 persons. Taking heigt
as a predictor, we will try to predict the gender of a person. We will code male and female with 1 and 0
respectively.
set.seed(1)
heightmale=rnorm(40,178,4) ## male heights mean=178
set.seed(2)
heightfemale=rnorm(30,166,3) ## female heights mean=166
gender=c(rep(1, 40), rep(0, 30)) ## coded male=1, female=0
height=c(heightmale,heightfemale)
mydata=as.data.frame(cbind(height, gender))
cbind(head(mydata), tail(mydata))

## height gender height gender

## 1 175.4942 1 166.0148 0
## 2 178.7346 1 158.6449 0
## 3 174.6575 1 167.4317 0
## 4 184.3811 1 164.2103 0
## 5 179.3180 1 168.3766 0
## 6 174.7181 1 166.8689 0
## make a model P(Y=male)
linmodel=lm(gender~height, data=mydata)
plot(gender~height, data=mydata, pch=16, col=ifelse(gender==0, 4,2),
ylim=c(-1,2), ylab="P(Y=male)")
abline(linmodel)
legend("topleft",c("Male", "Female"), pch=16, col=c(2,4))

Figure 1 shows the regression line for the model we proposed. We see that the regression line picked the
overall trend. If X = height goes up, probability of being a male goes up. If X goes down, probability of
being a male goes down. But, the line is not restricted in between 0 and 1 line. But since we are taking
some probability measure along y axis, this is not desirable. We rather want something shown in Figure 2.
Increasing X should increase the probability, but does not exceed 1. And decreasing X should decrease the
probability, but does not make it below zero.

Logit transformation

Now the question is, how to make this S-shaped curve? The answer is, by doing a transformation. We make
a transformation on the response, that relaxes the response to the entire real number line, so that we can
model it by β0 + β1 X + . For logistic regression, our response is a probability, which has an interval of [0 1].
So we want such a function, that maps this interval to R.
Such a function is logit function and defined as follows:

P
logit(P ) = log( )
1−P

2
2.0
Male
Female
1.0
P(Y=male)

0.0
−1.0

160 165 170 175 180 185

height

Figure 1: P(Y=male)=b0+b1*height
0.8
P(Y=male)

0.4
0.0

160 165 170 175 180 185

height

Figure 2: Desired regression line

3
This logit function has a domain of [0 1] and a range of R. Since log( 1−P
P
) ranges the entire real number,
we can actually model it as β0 + β1 X + .

Log odd

The term log( 1−P

P
) is called log-odd. It is the natural log of the ratio of probability of something to happen
(P ) and the probability of that thing NOT to happen (1 − P ).

Logistic transformation

Now with the logit transformation, we found a way to model binary response. We can write the model for
our gender-height data as follows:

P
log( ) = β0 + β1 X +
1−P

where, P represents the probability of being a male. We can estimate the regression parameters by maximum
likelihood method. Suppose the estimated parameters are b0 and b1 . We have

P̂
log( ) = b0 + b1 X
1 − P̂

Now we have the estiamted log odds. But we are interested in the actual probability values rather than log
odds. The fitted probabilites can be written as:

1
P̂ =
1 + exp{−(b0 + b1 X)}
Now the term b0 + b1 X can be any real number. But P̂ is capped in between 0 and 1. This transformation,
that maps the entire real number line in between 0 and 1 is called logistic transformation.

1
logistic(x) =
1 + exp(−x)

Note: it is easy to see that logistic function is the inverse of logit function, and logit is the
inverse of logistic.

Interpretation of parameters

Again, consider the model:

P̂
log( ) = b0 + b1 X
1 − P̂

b0 interpretation

If X = 0, then log( 1−P̂

P̂
) = b0 , which implies 1−
P̂
P̂
= exp(b0 ). So the interpretation would be, the odd of
being male is exp(b0 ). Had we coded female=1, then we would say the odd of being female is exp(b0 ).

4
b1 interpretation

Suppose, for X = X1 , P̂ = P1 and for X = X1 + 1, P̂ = P2 . Now we have,

P1
log( ) = b0 + b1 X1
1 − P1

P2
log( ) = b0 + b1 (X1 + 1)
1 − P2
From the above two,

P2 P1 P2 /(1 − P2 )
log( ) − log( ) = log( ) = b1
1 − P2 1 − P1 P1 /(1 − P1 )
P2 /(1 − P2 )
=⇒ = exp(b1 )
P1 /(1 − P1 )

The left hand side of above, the ratio of two odds, is called odd ratio (OR). If b1 > 0, OR > 1 and if b1 < 0,
OR < 1. So the interpretation of b1 is well understood if we describe two scenarios separately:
Interpretaion
• if b1 > 0, then with a unit increase in X, the odd of being male increases by (exp(b1 ) − 1) × 100%
• if b1 < 0, then with a unit increase in X, the odd of being male decreases by (1 − exp(b1 )) × 100%

Prediction: male or female

From the logistic model above, we get the probabilities of each indivudual to be male. But how can we
predict the categories? Probably the simplest way would be predict everybody as female with a probability
less than 0.5 and predict everybody as male with a probability greater or equal to 0.5. This particular value
(here 0.5), where we draw the margin is called threshold or cutoff.

Note: although 0.5 as a cutoff seems a natural and intuitive choice, this does not always give us
the best result. We will see that later.

Model performance

A trivial metric to assess the performance of a logistic model is the error rate. It is the ratio of the number of
wrong prediction and number of total prediction made. Sometimes we report the accuracy rate (1−error rate).
Later in the course, we will see that there are other metrics to assess logistic model performances.

5
Example in R

The data

We will work with cheese data for making a logistic model. The data is from an Australian study of cheddar
cheese. Samples of cheese were analyzed for their chemical composition and were subjected to taste tests.
The data has two continuous and two categorical variables:
• Continuous variables:
– AceticConc: Concentration of acetic acid in log scale
– H2SConc: Concentration of hydrogen sulphide in log scale
• Categorical variables:
– LacticConc: Lactic acid concentration, low or high
– TasteScore: A binary score of test, coded with 0 (bad) and 1 (good). This is our response
variable.
cdata=read.csv("cheese.csv")
head(cdata)

## AceticConc H2SConc LacticConc TasteScore

## 1 4.543 3.135 low 0
## 2 5.159 5.043 high 1
## 3 5.366 5.438 high 1
## 4 5.759 7.496 high 1
## 5 4.663 3.807 low 0
## 6 5.697 7.601 low 1

Descriptive statsitics and visualization

Below we produced some descriptive stats of the data. Here our response is TasteScore. For two groups of
TasteScore, we produced the means of AceticConc and LacticConc. They look different. But from this
stat only, we cannot make any inference unless we know the variability of the data. We will do that shortly.
Predictor LacticConc is categorical. Our response is also categorical. So showing a pivot table seems logical.
We produced the pivot table for these two variables. We see that the proportion of good looks different in
the two groups of LacticConc.
library(plyr)
ddply(cdata,~TasteScore,
summarise,MeanAct=mean(AceticConc),
MeanHS=mean(H2SConc) )

## TasteScore MeanAct MeanHS

## 1 0 5.186000 4.357857
## 2 1 5.771063 7.327687
table(cdata$TasteScore,cdata$LacticConc)

##
## high low
## 0 3 11
## 1 13 3
Now we will produce boxplots and density plots of the two numeric variables.
Figure 3 shows the boxplot and density plot of two numeric variables. For better visualization, we produced
two separate plots for each group of the target.

6
Hydrogen sulphide concentration
Boxplot: Acetic acid concentration Boxplot: Hydrogen sulphide concentration
6.5
Acetic acid concentration

10
factor(TasteScore) factor(TasteScore)
6.0 0 0
8
1 1
5.5
6
5.0 0.5 0.5
0.5 4 0.5
4.5
0 1 0 1
factor(TasteScore) factor(TasteScore)

Density: Acetic acid concentration Density: Hydrogen sulphide concentration

0.4

0.6 0.3
factor(TasteScore) factor(TasteScore)

Density
Density

0.4 0 0.2 0

1 1
0.2 0.1

0.0 0.0
4.5 5.0 5.5 6.0 6.5 4 6 8 10
Acetic acid concentration Hydrogen sulphide concentration

Figure 3: Acetic acid and hydrogen sulphide visualization visulization

For acetic acid concentration, the boxplot shows a pronounced difference in the distributions in two target
groups. But the density plot shows that there is a lot of overlapping between them. The shifting in
distributions in two groups is more pronounced for the hydrogen sulphide concentration. Both There is no
overlapping of the boxes. The density plot overlapping is also less than the other variable we examined.

Make logistic model

Let us make a logistic model for TasteScore with all other variables as predictors.
logModel1=glm(TasteScore~AceticConc+H2SConc+LacticConc, data=cdata,
family=binomial)
summary(logModel1)

##
## Call:
## glm(formula = TasteScore ~ AceticConc + H2SConc + LacticConc,
## family = binomial, data = cdata)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -2.03791 -0.30484 0.06527 0.39228 2.01177
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) -4.4911 7.9704 -0.563 0.5731
## AceticConc -0.1619 1.6254 -0.100 0.9207

7
## H2SConc 1.1796 0.5287 2.231 0.0257 *
## LacticConclow -2.3729 1.2851 -1.846 0.0648 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 41.455 on 29 degrees of freedom
## Residual deviance: 17.750 on 26 degrees of freedom
## AIC: 25.75
##
## Number of Fisher Scoring iterations: 6
We used the glm function to make a logistic model. The glm (stands for generalized linear model) function
can be used for many types of regression with appropriate family and link. The default choice for binomial
family is the logit link. So, the model above models the log odd of taste=good as a linear combination of
the predictors.
We see that acetic acid concentration does not have statistically significant predictive power to predict the
taste of cheddar cheese. It has a very high p-value, which is greater than the greatest default significance
level 0.1. So we drop this variable and make a model with other two predictors.
logModel2=glm(TasteScore~H2SConc+LacticConc, data=cdata,
family=binomial)
summary(logModel2)

##
## Call:
## glm(formula = TasteScore ~ H2SConc + LacticConc, family = binomial,
## data = cdata)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -2.0083 -0.2999 0.0658 0.3788 2.0157
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) -5.2497 2.5031 -2.097 0.0360 *
## H2SConc 1.1538 0.4541 2.541 0.0111 *
## LacticConclow -2.3432 1.2472 -1.879 0.0603 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 41.455 on 29 degrees of freedom
## Residual deviance: 17.760 on 27 degrees of freedom
## AIC: 23.76
##
## Number of Fisher Scoring iterations: 6
Both variables now seem to have some predictive power. The H2SConc variable is statistically significant at
5% level (p-value less than 0.05) and the LacticConclow is statistically significant at 10% level.

8
Parameter interpretation

bH2SConc = 1.15 and exp(1.15) = 3.16. So, with a unit increase of hydrogen sulphide concentration, the odd
of the taste of the cheese being good increases by 216%.
bLacticConclow = −2.34. Now LacticConc is a categorical variable with two levels. Since we see the estimate
for low level, the other level, high was taken as reference. exp(−2.34) = 0.096. So, the odd of cheese being
good with low lactic acid concentration is only 9.6% of the odd of cheese being good with a high lactic acid
concentration.
Here is one thing. By default, R took high as the reference for LacticConc variable. When R needs to
convert a character variable to factor variable, it makes one group reference. My best guess, whichever group
comes first alphabetically, R makes that group reference. But it is possible to change the reference.
In Model3 below, we took low as reference for LacticConc.
logModel3=glm(TasteScore~H2SConc+LacticConc, data=cdata,
family=binomial)
summary(logModel3)

##
## Call:
## glm(formula = TasteScore ~ H2SConc + LacticConc, family = binomial,
## data = cdata)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -2.0083 -0.2999 0.0658 0.3788 2.0157
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) -7.5928 2.7537 -2.757 0.00583 **
## H2SConc 1.1538 0.4541 2.541 0.01107 *
## LacticConchigh 2.3432 1.2472 1.879 0.06029 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 41.455 on 29 degrees of freedom
## Residual deviance: 17.760 on 27 degrees of freedom
## AIC: 23.76
##
## Number of Fisher Scoring iterations: 6
We see that changing the reference chages the sign of the parameter. Now we have bLacticConchigh = 2.34 and
exp(2.34) = 10.38. So, the odd of being good with high lactic acid concentration is 10.34 times of the odd of
being good with low concentration level. And of course, 1/0.09632764 = 10.38.

9
Calculate log odd and fitted probability

Let us work with the last model, where we took low as the reference for LacticConc.
Now we will calculate the log odds of being good. Also, we will calculte the probabilities of being good by
doing the logistic transformation of the log odds.
log_odds=predict(logModel3, newdata = cdata)
## check few of them
log_odds[1:5]

## 1 2 3 4 5
## -3.9757360 0.5688275 1.0245709 3.3990514 -3.2003954
## calculate probabilities by logistic transformation

## write a function for logistic tranformation

logistic=function(x){1/(1+exp(-x))}
## calculate probability of being good
prob_being_good=logistic(log_odds)

## check first 5
prob_being_good[1:5]

## 1 2 3 4 5
## 0.01841983 0.63849259 0.73586199 0.96767488 0.03915084

Get error rate

As it was mentioned, error rate is one way ti assess the performance of a logistic model. But we need to
classify the predictions as “good” or “bad”. Let us use 0.5 as our cutoff. Any value less than 0.5 will be
classified as bad (0), and the others will be classified as good (1).
cutoff=0.5
predicted_class=ifelse(prob_being_good<cutoff, 0, 1)
original_class=cdata$TasteScore
## make a confusion/contingency matrix
con_mat=table(original_class, predicted_class)
con_mat

## predicted_class
## original_class 0 1
## 0 11 3
## 1 2 14
From the contigency table, we see that 5 were predicted wrong out of 30. So the error rate here is 5/30=16.67%.
Note: as stated earlier, the error rate may change as we change the cutoff.

Validation of model: training and testing

For validation of the model, we alomost always save a portion of the data to validate our model. We do not
touch this portion while making the model.
Let us save last 10 observations of our data for validation purpose.

10
train=cdata[c(1:20),] # training data
test=cdata[c(21:30),] # test data
model=glm(TasteScore~H2SConc+LacticConc, data=train,
family=binomial) # make a mogistic model
pred_log_odd_test=predict(model, newdata = test) ## log odds

pred_probs_test=logistic(pred_log_odd_test) ## calculate probability

cutoff=0.5 ## set cutoff
pred_class_test=ifelse(pred_probs_test<cutoff, 0, 1)
original_class_test=test$TasteScore
table(original_class_test,pred_class_test)

## pred_class_test
## original_class_test 0 1
## 0 5 2
## 1 0 3
So, in the test data set, we got 2 misclassified out of 10, with an error rate of 20%.

Logistic Regression
0% (1)
Logistic Regression
49 pages
Regresi Logistik
No ratings yet
Regresi Logistik
34 pages
3 Logistic Regression
No ratings yet
3 Logistic Regression
21 pages
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
No ratings yet
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
10 pages
Logistic Regression Example Illustrated
No ratings yet
Logistic Regression Example Illustrated
20 pages
Data Analytics Using R
No ratings yet
Data Analytics Using R
23 pages
Logisticregression PDF
No ratings yet
Logisticregression PDF
48 pages
Logistic Regression: 30 March 2016
No ratings yet
Logistic Regression: 30 March 2016
49 pages
L9 Logistical Regression Models Updated
No ratings yet
L9 Logistical Regression Models Updated
10 pages
Regression3 Slides
No ratings yet
Regression3 Slides
47 pages
Introduction To Logistic Regression
No ratings yet
Introduction To Logistic Regression
20 pages
Lecture 2.3.1 (1)
No ratings yet
Lecture 2.3.1 (1)
50 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Lec-4 Logistic Regression
No ratings yet
Lec-4 Logistic Regression
54 pages
Regression Logistic 4
No ratings yet
Regression Logistic 4
51 pages
What Is Logistic Regression
No ratings yet
What Is Logistic Regression
20 pages
02 LogisticRegression
No ratings yet
02 LogisticRegression
29 pages
Ilovepdf Merged (24)
No ratings yet
Ilovepdf Merged (24)
208 pages
Lecture 22. Glm
No ratings yet
Lecture 22. Glm
41 pages
An Introduction To Logistic Regression in R
No ratings yet
An Introduction To Logistic Regression in R
25 pages
A Simple But Effective Logistic Regression Derivation
No ratings yet
A Simple But Effective Logistic Regression Derivation
6 pages
Logistic Regression
No ratings yet
Logistic Regression
15 pages
Detailed_Logistic_Regression
No ratings yet
Detailed_Logistic_Regression
30 pages
Logistic Regression
100% (3)
Logistic Regression
41 pages
Log Reg
No ratings yet
Log Reg
32 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Nota
No ratings yet
Logistic Nota
87 pages
Lesson 13 Logistic Regression
No ratings yet
Lesson 13 Logistic Regression
26 pages
Article: An Introduction Tos Logistic Regression Analysis and Reporting
No ratings yet
Article: An Introduction Tos Logistic Regression Analysis and Reporting
5 pages
Logistic Regression: Multivariate Analysis
No ratings yet
Logistic Regression: Multivariate Analysis
29 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
Unit - 5
No ratings yet
Unit - 5
111 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
Logistic Regression
100% (1)
Logistic Regression
37 pages
Chapter 16 - Logistic Regression Model
No ratings yet
Chapter 16 - Logistic Regression Model
7 pages
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Logistic Regression in R and Python
No ratings yet
Logistic Regression in R and Python
9 pages
Capstone - Https:Users - Ox.ac - Uk: Jesu0073:Lecture 3:LogisticRegression
No ratings yet
Capstone - Https:Users - Ox.ac - Uk: Jesu0073:Lecture 3:LogisticRegression
17 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
Logistic Regression
No ratings yet
Logistic Regression
54 pages
09_23ECE216_LogisticRegression
No ratings yet
09_23ECE216_LogisticRegression
40 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Logistic Regression & Practice
100% (1)
Logistic Regression & Practice
51 pages
7.logistics Regression - BDSM - Oct - 2020
No ratings yet
7.logistics Regression - BDSM - Oct - 2020
49 pages
Logistics Regression
No ratings yet
Logistics Regression
10 pages
Probit Logit Interpretation
No ratings yet
Probit Logit Interpretation
26 pages
29 LogisticRegression
No ratings yet
29 LogisticRegression
15 pages
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
No ratings yet
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
10 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Q 10 A Q 6B Logistic Regression Class
No ratings yet
Q 10 A Q 6B Logistic Regression Class
18 pages
Chapter 5-LDVM-2024
No ratings yet
Chapter 5-LDVM-2024
27 pages
Binary Logistic Regression
No ratings yet
Binary Logistic Regression
8 pages
09-Limited Dependent Variable Models
No ratings yet
09-Limited Dependent Variable Models
71 pages
Logistic Reg
No ratings yet
Logistic Reg
54 pages
Cda Chapter Three
No ratings yet
Cda Chapter Three
18 pages
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
From Everand
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
Vladimir Kiselev
No ratings yet
Unlocking Statistics for the Social Sciences
From Everand
Unlocking Statistics for the Social Sciences
Norma Sinclair
No ratings yet
LH D6240a PDF
No ratings yet
LH D6240a PDF
54 pages
Minor Project I
No ratings yet
Minor Project I
4 pages
NEST 4C Overview LinkNEST PolSARpro
No ratings yet
NEST 4C Overview LinkNEST PolSARpro
77 pages
CIGS and Perovskite Solar Cells An Overview
No ratings yet
CIGS and Perovskite Solar Cells An Overview
13 pages
Research Paper1
No ratings yet
Research Paper1
8 pages
scribd.vdownloaders.com_mgt-300-501-mbti-personality-test
No ratings yet
scribd.vdownloaders.com_mgt-300-501-mbti-personality-test
4 pages
Characteristics-of-Different-Systems-of-Stratifications__
No ratings yet
Characteristics-of-Different-Systems-of-Stratifications__
11 pages
Assignment On Consumer Behavior
No ratings yet
Assignment On Consumer Behavior
7 pages
Designing Universal Family Care Digital Version FINAL
No ratings yet
Designing Universal Family Care Digital Version FINAL
310 pages
Series Bible Pitch Deck Standards 1 0
100% (2)
Series Bible Pitch Deck Standards 1 0
186 pages
257revc Abyc
No ratings yet
257revc Abyc
2 pages
Test Units 3-4 10H00-13H00 Moodle
No ratings yet
Test Units 3-4 10H00-13H00 Moodle
5 pages
Germination Value A New Formula: Pinus Radiata
No ratings yet
Germination Value A New Formula: Pinus Radiata
5 pages
TEACHING ENGLISH AS A FOREIGN LANGUAGE Proposals for the language classroom (García-Pastor(ed.)) (Z-Library)
No ratings yet
TEACHING ENGLISH AS A FOREIGN LANGUAGE Proposals for the language classroom (García-Pastor(ed.)) (Z-Library)
237 pages
Determining The Hardness of Water Via EDTA-Titration
50% (2)
Determining The Hardness of Water Via EDTA-Titration
2 pages
Unit1 - Introduction To IoT
No ratings yet
Unit1 - Introduction To IoT
69 pages
UPS (Uninterruptable Power Supply) : EEE 1217 Analog Electronics CSE'20
No ratings yet
UPS (Uninterruptable Power Supply) : EEE 1217 Analog Electronics CSE'20
15 pages
DCMP-lab1 Handout
No ratings yet
DCMP-lab1 Handout
2 pages
MEC 424 Physical Pendulum - Wooden Pendulum
100% (1)
MEC 424 Physical Pendulum - Wooden Pendulum
20 pages
Practical Questions-Week 2 With Solution
No ratings yet
Practical Questions-Week 2 With Solution
6 pages
Malayalam Kambi - Ma 2 - 06
No ratings yet
Malayalam Kambi - Ma 2 - 06
4 pages
Epekto NG Kompyuter Sa Mga Estudyante Thesis
100% (3)
Epekto NG Kompyuter Sa Mga Estudyante Thesis
6 pages
Pengaruh Brand Ambassador Bts Terhadap Purchase Intention Yang Dimediasi Oleh Brand Awareness Tokopedia Di Indonesia
No ratings yet
Pengaruh Brand Ambassador Bts Terhadap Purchase Intention Yang Dimediasi Oleh Brand Awareness Tokopedia Di Indonesia
13 pages
Week 5 DQ 1 Economics
No ratings yet
Week 5 DQ 1 Economics
4 pages
Water Balance of Plants-1
100% (1)
Water Balance of Plants-1
49 pages
8D - Problem Resolution Report: Complaint No. Part Name Date Customer Problem Status New Repeated D-1 Problem Description
No ratings yet
8D - Problem Resolution Report: Complaint No. Part Name Date Customer Problem Status New Repeated D-1 Problem Description
2 pages
Plasma Spect USP 40
No ratings yet
Plasma Spect USP 40
4 pages
SAP PS Budget Management
No ratings yet
SAP PS Budget Management
8 pages
Work Stress
No ratings yet
Work Stress
18 pages
Elizia Da Costa Alves - 03411740007007 - Tugas03 Seismik A
No ratings yet
Elizia Da Costa Alves - 03411740007007 - Tugas03 Seismik A
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

1_LogisticRegressionNotes1

Uploaded by

1_LogisticRegressionNotes1

Uploaded by

Logistic Regression Notes

The range of response in linear regression

## height gender height gender

160 165 170 175 180 185

160 165 170 175 180 185

Figure 2: Desired regression line

The term log( 1−P

Again, consider the model:

If X = 0, then log( 1−P̂

Suppose, for X = X1 , P̂ = P1 and for X = X1 + 1, P̂ = P2 . Now we have,

Prediction: male or female

## AceticConc H2SConc LacticConc TasteScore

Descriptive statsitics and visualization

## TasteScore MeanAct MeanHS

Density: Acetic acid concentration Density: Hydrogen sulphide concentration

Figure 3: Acetic acid and hydrogen sulphide visualization visulization

Make logistic model

## write a function for logistic tranformation

Get error rate

Validation of model: training and testing

pred_probs_test=logistic(pred_log_odd_test) ## calculate probability

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.