Univariate or multivariate regression for this project?





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty{ margin-bottom:0;
}






up vote
3
down vote

favorite












I'm looking at the risk of seizure in patients with metastasizing brain cancer and so far have several variables that I want to check to my dependent variable of seizure yes/no.



These are variables such as age, sex, tumor size and much more.



Now if I understand it correctly, first I run a univariate regression and the result will tell me whether there is a statistical significant relationship between one variable and my dependent variable.



However, I see in similar studies that they also do multivariate regression, but do not specify how, exactly.



Can someone help me understand why I would do a multivariate regression in such a project?



Thank you.










share|cite|improve this question







New contributor




Paze is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
















  • 4




    Just a note on terminology: multivariate regressions mean you have multiple dependent variables. These can get complicated. Multiple regressions mean you have multiple IVs. It's quite straight forward as the answer below shows. Also, the advantage of using multiple IV's in the same model lets you get estimates for each IV controlling for the other ones, which is very beneficial.
    – Huy Pham
    Dec 2 at 17:16

















up vote
3
down vote

favorite












I'm looking at the risk of seizure in patients with metastasizing brain cancer and so far have several variables that I want to check to my dependent variable of seizure yes/no.



These are variables such as age, sex, tumor size and much more.



Now if I understand it correctly, first I run a univariate regression and the result will tell me whether there is a statistical significant relationship between one variable and my dependent variable.



However, I see in similar studies that they also do multivariate regression, but do not specify how, exactly.



Can someone help me understand why I would do a multivariate regression in such a project?



Thank you.










share|cite|improve this question







New contributor




Paze is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
















  • 4




    Just a note on terminology: multivariate regressions mean you have multiple dependent variables. These can get complicated. Multiple regressions mean you have multiple IVs. It's quite straight forward as the answer below shows. Also, the advantage of using multiple IV's in the same model lets you get estimates for each IV controlling for the other ones, which is very beneficial.
    – Huy Pham
    Dec 2 at 17:16













up vote
3
down vote

favorite









up vote
3
down vote

favorite











I'm looking at the risk of seizure in patients with metastasizing brain cancer and so far have several variables that I want to check to my dependent variable of seizure yes/no.



These are variables such as age, sex, tumor size and much more.



Now if I understand it correctly, first I run a univariate regression and the result will tell me whether there is a statistical significant relationship between one variable and my dependent variable.



However, I see in similar studies that they also do multivariate regression, but do not specify how, exactly.



Can someone help me understand why I would do a multivariate regression in such a project?



Thank you.










share|cite|improve this question







New contributor




Paze is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











I'm looking at the risk of seizure in patients with metastasizing brain cancer and so far have several variables that I want to check to my dependent variable of seizure yes/no.



These are variables such as age, sex, tumor size and much more.



Now if I understand it correctly, first I run a univariate regression and the result will tell me whether there is a statistical significant relationship between one variable and my dependent variable.



However, I see in similar studies that they also do multivariate regression, but do not specify how, exactly.



Can someone help me understand why I would do a multivariate regression in such a project?



Thank you.







regression medicine






share|cite|improve this question







New contributor




Paze is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|cite|improve this question







New contributor




Paze is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|cite|improve this question




share|cite|improve this question






New contributor




Paze is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked Dec 2 at 15:42









Paze

1354




1354




New contributor




Paze is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





Paze is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






Paze is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.








  • 4




    Just a note on terminology: multivariate regressions mean you have multiple dependent variables. These can get complicated. Multiple regressions mean you have multiple IVs. It's quite straight forward as the answer below shows. Also, the advantage of using multiple IV's in the same model lets you get estimates for each IV controlling for the other ones, which is very beneficial.
    – Huy Pham
    Dec 2 at 17:16














  • 4




    Just a note on terminology: multivariate regressions mean you have multiple dependent variables. These can get complicated. Multiple regressions mean you have multiple IVs. It's quite straight forward as the answer below shows. Also, the advantage of using multiple IV's in the same model lets you get estimates for each IV controlling for the other ones, which is very beneficial.
    – Huy Pham
    Dec 2 at 17:16








4




4




Just a note on terminology: multivariate regressions mean you have multiple dependent variables. These can get complicated. Multiple regressions mean you have multiple IVs. It's quite straight forward as the answer below shows. Also, the advantage of using multiple IV's in the same model lets you get estimates for each IV controlling for the other ones, which is very beneficial.
– Huy Pham
Dec 2 at 17:16




Just a note on terminology: multivariate regressions mean you have multiple dependent variables. These can get complicated. Multiple regressions mean you have multiple IVs. It's quite straight forward as the answer below shows. Also, the advantage of using multiple IV's in the same model lets you get estimates for each IV controlling for the other ones, which is very beneficial.
– Huy Pham
Dec 2 at 17:16










1 Answer
1






active

oldest

votes

















up vote
6
down vote



accepted










For the sake of simplicity, let's say your independent variables consist of age, sex and tumour size only.



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable age, the model is ultimately enabling you to answer this question:



How does age affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their sex and tumour size)?



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable sex, the model is ultimately enabling you to answer this question:



How does sex affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their age and tumour size)?



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable tumour size, the model is ultimately enabling you to answer this question:



How does tumour size affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their age and sex)?



When you fit a multiple binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variables age, sex and tumour size, the model is ultimately enabling you to answer more pointed questions (assuming you only include main effects for these independent variables in your model):




  1. How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size?


  2. How does sex affect the probability of seizure for patients in the target population having the same age and the same tumour size?


  3. How does tumour size affect the probability of seizure for patients in the target population having the same age and the same sex?



Of course, if you include interactions between any of the independent variables in your multiple binary logistic regression model, that expands the list of questions you can ask.






share|cite|improve this answer



















  • 2




    Thank you. That explains a lot. So when inputting the multivariate analysis in my statistical software (STATA), if I want to answer the question: "How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size)?" Do I select seizure, sex and tumor size as the dependent variables, and age as the independent variable?
    – Paze
    Dec 2 at 16:41






  • 1




    You're welcome! No, you only select seizure as the dependent variable and then include age, sex and tumour size together as the independent variable. The 3 questions I listed in my answers, 1., 2. and 3., can all be answered based on this one model! That's the beauty of multiple binary logistic regression!
    – Isabella Ghement
    Dec 2 at 16:44






  • 1




    Paze, Isabella provided you with some nice guidance, but there are many other factors you'll want to consider to ensure you understand what you are doing. Get a copy of "Applied Logistic Regression" by Hosmer, Lemenshow, and Sturdivant. You'll want to make sure your continuous variables are linear in the log of odds, you'll want to understand why a variable can be statistically significant in a simple regression but insignificant in multiple regression. The book does a great job of taking you from square 1 to proficiency in logistic regression. Statistics is very much garbage in garbage out.
    – ColorStatistics
    Dec 2 at 18:04






  • 1




    And thank you very much Isabella for your invaluable help!
    – Paze
    Dec 2 at 18:37






  • 1




    Grea advice, @ColorStatistics! The only thing I don't agree with is "Statistics is very much garbage in garbage out". I would qualify that statement as follows: "When applied thoughtlessly, statistics can very much be garbage in, garbage out." 😝
    – Isabella Ghement
    Dec 2 at 18:54











Your Answer





StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "65"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});






Paze is a new contributor. Be nice, and check out our Code of Conduct.










draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f379924%2funivariate-or-multivariate-regression-for-this-project%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes








up vote
6
down vote



accepted










For the sake of simplicity, let's say your independent variables consist of age, sex and tumour size only.



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable age, the model is ultimately enabling you to answer this question:



How does age affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their sex and tumour size)?



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable sex, the model is ultimately enabling you to answer this question:



How does sex affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their age and tumour size)?



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable tumour size, the model is ultimately enabling you to answer this question:



How does tumour size affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their age and sex)?



When you fit a multiple binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variables age, sex and tumour size, the model is ultimately enabling you to answer more pointed questions (assuming you only include main effects for these independent variables in your model):




  1. How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size?


  2. How does sex affect the probability of seizure for patients in the target population having the same age and the same tumour size?


  3. How does tumour size affect the probability of seizure for patients in the target population having the same age and the same sex?



Of course, if you include interactions between any of the independent variables in your multiple binary logistic regression model, that expands the list of questions you can ask.






share|cite|improve this answer



















  • 2




    Thank you. That explains a lot. So when inputting the multivariate analysis in my statistical software (STATA), if I want to answer the question: "How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size)?" Do I select seizure, sex and tumor size as the dependent variables, and age as the independent variable?
    – Paze
    Dec 2 at 16:41






  • 1




    You're welcome! No, you only select seizure as the dependent variable and then include age, sex and tumour size together as the independent variable. The 3 questions I listed in my answers, 1., 2. and 3., can all be answered based on this one model! That's the beauty of multiple binary logistic regression!
    – Isabella Ghement
    Dec 2 at 16:44






  • 1




    Paze, Isabella provided you with some nice guidance, but there are many other factors you'll want to consider to ensure you understand what you are doing. Get a copy of "Applied Logistic Regression" by Hosmer, Lemenshow, and Sturdivant. You'll want to make sure your continuous variables are linear in the log of odds, you'll want to understand why a variable can be statistically significant in a simple regression but insignificant in multiple regression. The book does a great job of taking you from square 1 to proficiency in logistic regression. Statistics is very much garbage in garbage out.
    – ColorStatistics
    Dec 2 at 18:04






  • 1




    And thank you very much Isabella for your invaluable help!
    – Paze
    Dec 2 at 18:37






  • 1




    Grea advice, @ColorStatistics! The only thing I don't agree with is "Statistics is very much garbage in garbage out". I would qualify that statement as follows: "When applied thoughtlessly, statistics can very much be garbage in, garbage out." 😝
    – Isabella Ghement
    Dec 2 at 18:54















up vote
6
down vote



accepted










For the sake of simplicity, let's say your independent variables consist of age, sex and tumour size only.



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable age, the model is ultimately enabling you to answer this question:



How does age affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their sex and tumour size)?



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable sex, the model is ultimately enabling you to answer this question:



How does sex affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their age and tumour size)?



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable tumour size, the model is ultimately enabling you to answer this question:



How does tumour size affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their age and sex)?



When you fit a multiple binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variables age, sex and tumour size, the model is ultimately enabling you to answer more pointed questions (assuming you only include main effects for these independent variables in your model):




  1. How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size?


  2. How does sex affect the probability of seizure for patients in the target population having the same age and the same tumour size?


  3. How does tumour size affect the probability of seizure for patients in the target population having the same age and the same sex?



Of course, if you include interactions between any of the independent variables in your multiple binary logistic regression model, that expands the list of questions you can ask.






share|cite|improve this answer



















  • 2




    Thank you. That explains a lot. So when inputting the multivariate analysis in my statistical software (STATA), if I want to answer the question: "How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size)?" Do I select seizure, sex and tumor size as the dependent variables, and age as the independent variable?
    – Paze
    Dec 2 at 16:41






  • 1




    You're welcome! No, you only select seizure as the dependent variable and then include age, sex and tumour size together as the independent variable. The 3 questions I listed in my answers, 1., 2. and 3., can all be answered based on this one model! That's the beauty of multiple binary logistic regression!
    – Isabella Ghement
    Dec 2 at 16:44






  • 1




    Paze, Isabella provided you with some nice guidance, but there are many other factors you'll want to consider to ensure you understand what you are doing. Get a copy of "Applied Logistic Regression" by Hosmer, Lemenshow, and Sturdivant. You'll want to make sure your continuous variables are linear in the log of odds, you'll want to understand why a variable can be statistically significant in a simple regression but insignificant in multiple regression. The book does a great job of taking you from square 1 to proficiency in logistic regression. Statistics is very much garbage in garbage out.
    – ColorStatistics
    Dec 2 at 18:04






  • 1




    And thank you very much Isabella for your invaluable help!
    – Paze
    Dec 2 at 18:37






  • 1




    Grea advice, @ColorStatistics! The only thing I don't agree with is "Statistics is very much garbage in garbage out". I would qualify that statement as follows: "When applied thoughtlessly, statistics can very much be garbage in, garbage out." 😝
    – Isabella Ghement
    Dec 2 at 18:54













up vote
6
down vote



accepted







up vote
6
down vote



accepted






For the sake of simplicity, let's say your independent variables consist of age, sex and tumour size only.



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable age, the model is ultimately enabling you to answer this question:



How does age affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their sex and tumour size)?



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable sex, the model is ultimately enabling you to answer this question:



How does sex affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their age and tumour size)?



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable tumour size, the model is ultimately enabling you to answer this question:



How does tumour size affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their age and sex)?



When you fit a multiple binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variables age, sex and tumour size, the model is ultimately enabling you to answer more pointed questions (assuming you only include main effects for these independent variables in your model):




  1. How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size?


  2. How does sex affect the probability of seizure for patients in the target population having the same age and the same tumour size?


  3. How does tumour size affect the probability of seizure for patients in the target population having the same age and the same sex?



Of course, if you include interactions between any of the independent variables in your multiple binary logistic regression model, that expands the list of questions you can ask.






share|cite|improve this answer














For the sake of simplicity, let's say your independent variables consist of age, sex and tumour size only.



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable age, the model is ultimately enabling you to answer this question:



How does age affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their sex and tumour size)?



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable sex, the model is ultimately enabling you to answer this question:



How does sex affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their age and tumour size)?



When you fit a univariate binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variable tumour size, the model is ultimately enabling you to answer this question:



How does tumour size affect the probability of seizure in the target patient population (i.e., for all patients in the population, regardless of their age and sex)?



When you fit a multiple binary logistic regression model relating your dependent variable (seizure, yes or no) to the independent variables age, sex and tumour size, the model is ultimately enabling you to answer more pointed questions (assuming you only include main effects for these independent variables in your model):




  1. How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size?


  2. How does sex affect the probability of seizure for patients in the target population having the same age and the same tumour size?


  3. How does tumour size affect the probability of seizure for patients in the target population having the same age and the same sex?



Of course, if you include interactions between any of the independent variables in your multiple binary logistic regression model, that expands the list of questions you can ask.







share|cite|improve this answer














share|cite|improve this answer



share|cite|improve this answer








edited Dec 2 at 16:38

























answered Dec 2 at 16:29









Isabella Ghement

5,664320




5,664320








  • 2




    Thank you. That explains a lot. So when inputting the multivariate analysis in my statistical software (STATA), if I want to answer the question: "How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size)?" Do I select seizure, sex and tumor size as the dependent variables, and age as the independent variable?
    – Paze
    Dec 2 at 16:41






  • 1




    You're welcome! No, you only select seizure as the dependent variable and then include age, sex and tumour size together as the independent variable. The 3 questions I listed in my answers, 1., 2. and 3., can all be answered based on this one model! That's the beauty of multiple binary logistic regression!
    – Isabella Ghement
    Dec 2 at 16:44






  • 1




    Paze, Isabella provided you with some nice guidance, but there are many other factors you'll want to consider to ensure you understand what you are doing. Get a copy of "Applied Logistic Regression" by Hosmer, Lemenshow, and Sturdivant. You'll want to make sure your continuous variables are linear in the log of odds, you'll want to understand why a variable can be statistically significant in a simple regression but insignificant in multiple regression. The book does a great job of taking you from square 1 to proficiency in logistic regression. Statistics is very much garbage in garbage out.
    – ColorStatistics
    Dec 2 at 18:04






  • 1




    And thank you very much Isabella for your invaluable help!
    – Paze
    Dec 2 at 18:37






  • 1




    Grea advice, @ColorStatistics! The only thing I don't agree with is "Statistics is very much garbage in garbage out". I would qualify that statement as follows: "When applied thoughtlessly, statistics can very much be garbage in, garbage out." 😝
    – Isabella Ghement
    Dec 2 at 18:54














  • 2




    Thank you. That explains a lot. So when inputting the multivariate analysis in my statistical software (STATA), if I want to answer the question: "How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size)?" Do I select seizure, sex and tumor size as the dependent variables, and age as the independent variable?
    – Paze
    Dec 2 at 16:41






  • 1




    You're welcome! No, you only select seizure as the dependent variable and then include age, sex and tumour size together as the independent variable. The 3 questions I listed in my answers, 1., 2. and 3., can all be answered based on this one model! That's the beauty of multiple binary logistic regression!
    – Isabella Ghement
    Dec 2 at 16:44






  • 1




    Paze, Isabella provided you with some nice guidance, but there are many other factors you'll want to consider to ensure you understand what you are doing. Get a copy of "Applied Logistic Regression" by Hosmer, Lemenshow, and Sturdivant. You'll want to make sure your continuous variables are linear in the log of odds, you'll want to understand why a variable can be statistically significant in a simple regression but insignificant in multiple regression. The book does a great job of taking you from square 1 to proficiency in logistic regression. Statistics is very much garbage in garbage out.
    – ColorStatistics
    Dec 2 at 18:04






  • 1




    And thank you very much Isabella for your invaluable help!
    – Paze
    Dec 2 at 18:37






  • 1




    Grea advice, @ColorStatistics! The only thing I don't agree with is "Statistics is very much garbage in garbage out". I would qualify that statement as follows: "When applied thoughtlessly, statistics can very much be garbage in, garbage out." 😝
    – Isabella Ghement
    Dec 2 at 18:54








2




2




Thank you. That explains a lot. So when inputting the multivariate analysis in my statistical software (STATA), if I want to answer the question: "How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size)?" Do I select seizure, sex and tumor size as the dependent variables, and age as the independent variable?
– Paze
Dec 2 at 16:41




Thank you. That explains a lot. So when inputting the multivariate analysis in my statistical software (STATA), if I want to answer the question: "How does age affect the probability of seizure for patients in the target patient population having the same sex and the same tumour size)?" Do I select seizure, sex and tumor size as the dependent variables, and age as the independent variable?
– Paze
Dec 2 at 16:41




1




1




You're welcome! No, you only select seizure as the dependent variable and then include age, sex and tumour size together as the independent variable. The 3 questions I listed in my answers, 1., 2. and 3., can all be answered based on this one model! That's the beauty of multiple binary logistic regression!
– Isabella Ghement
Dec 2 at 16:44




You're welcome! No, you only select seizure as the dependent variable and then include age, sex and tumour size together as the independent variable. The 3 questions I listed in my answers, 1., 2. and 3., can all be answered based on this one model! That's the beauty of multiple binary logistic regression!
– Isabella Ghement
Dec 2 at 16:44




1




1




Paze, Isabella provided you with some nice guidance, but there are many other factors you'll want to consider to ensure you understand what you are doing. Get a copy of "Applied Logistic Regression" by Hosmer, Lemenshow, and Sturdivant. You'll want to make sure your continuous variables are linear in the log of odds, you'll want to understand why a variable can be statistically significant in a simple regression but insignificant in multiple regression. The book does a great job of taking you from square 1 to proficiency in logistic regression. Statistics is very much garbage in garbage out.
– ColorStatistics
Dec 2 at 18:04




Paze, Isabella provided you with some nice guidance, but there are many other factors you'll want to consider to ensure you understand what you are doing. Get a copy of "Applied Logistic Regression" by Hosmer, Lemenshow, and Sturdivant. You'll want to make sure your continuous variables are linear in the log of odds, you'll want to understand why a variable can be statistically significant in a simple regression but insignificant in multiple regression. The book does a great job of taking you from square 1 to proficiency in logistic regression. Statistics is very much garbage in garbage out.
– ColorStatistics
Dec 2 at 18:04




1




1




And thank you very much Isabella for your invaluable help!
– Paze
Dec 2 at 18:37




And thank you very much Isabella for your invaluable help!
– Paze
Dec 2 at 18:37




1




1




Grea advice, @ColorStatistics! The only thing I don't agree with is "Statistics is very much garbage in garbage out". I would qualify that statement as follows: "When applied thoughtlessly, statistics can very much be garbage in, garbage out." 😝
– Isabella Ghement
Dec 2 at 18:54




Grea advice, @ColorStatistics! The only thing I don't agree with is "Statistics is very much garbage in garbage out". I would qualify that statement as follows: "When applied thoughtlessly, statistics can very much be garbage in, garbage out." 😝
– Isabella Ghement
Dec 2 at 18:54










Paze is a new contributor. Be nice, and check out our Code of Conduct.










draft saved

draft discarded


















Paze is a new contributor. Be nice, and check out our Code of Conduct.













Paze is a new contributor. Be nice, and check out our Code of Conduct.












Paze is a new contributor. Be nice, and check out our Code of Conduct.
















Thanks for contributing an answer to Cross Validated!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.





Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


Please pay close attention to the following guidance:


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f379924%2funivariate-or-multivariate-regression-for-this-project%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

"Incorrect syntax near the keyword 'ON'. (on update cascade, on delete cascade,)

Alcedinidae

Origin of the phrase “under your belt”?