Is a model fitted to data or is data fitted to a model?
$begingroup$
Is there a conceptual or procedural difference between fitting a model to data and fitting data to model? An example of the first wording can be seen in https://courses.washington.edu/matlab1/ModelFitting.html, and of the second in https://reference.wolfram.com/applications/eda/FittingDataToLinearModelsByLeast-SquaresTechniques.html.
terminology
$endgroup$
add a comment |
$begingroup$
Is there a conceptual or procedural difference between fitting a model to data and fitting data to model? An example of the first wording can be seen in https://courses.washington.edu/matlab1/ModelFitting.html, and of the second in https://reference.wolfram.com/applications/eda/FittingDataToLinearModelsByLeast-SquaresTechniques.html.
terminology
$endgroup$
7
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
Mar 24 at 2:38
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
Mar 25 at 9:45
add a comment |
$begingroup$
Is there a conceptual or procedural difference between fitting a model to data and fitting data to model? An example of the first wording can be seen in https://courses.washington.edu/matlab1/ModelFitting.html, and of the second in https://reference.wolfram.com/applications/eda/FittingDataToLinearModelsByLeast-SquaresTechniques.html.
terminology
$endgroup$
Is there a conceptual or procedural difference between fitting a model to data and fitting data to model? An example of the first wording can be seen in https://courses.washington.edu/matlab1/ModelFitting.html, and of the second in https://reference.wolfram.com/applications/eda/FittingDataToLinearModelsByLeast-SquaresTechniques.html.
terminology
terminology
edited Mar 24 at 8:18
Nick Cox
39.1k587131
39.1k587131
asked Mar 24 at 2:20
enjayesenjayes
1187
1187
7
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
Mar 24 at 2:38
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
Mar 25 at 9:45
add a comment |
7
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
Mar 24 at 2:38
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
Mar 25 at 9:45
7
7
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
Mar 24 at 2:38
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
Mar 24 at 2:38
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
Mar 25 at 9:45
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
Mar 25 at 9:45
add a comment |
4 Answers
4
active
oldest
votes
$begingroup$
Pretty much every source or person I've ever interacted with except the Wolfram source you linked refers to the process as fitting a model to data. This makes sense, since the model is the dynamic object and the data is static (a.k.a. fixed and constant).
To put a point on it, I like Larry Wasserman's approach to this. In his telling, a statistical model is a collection of distributions. For example, the collection of all normal distributions:
$$ { text{Normal}(mu, sigma) : mu, sigma in R, sigma > 0 } $$
or the set of all Poisson distributions:
$$ { text{Poisson}(lambda) : lambda in R, lambda > 0 } $$
Fitting a distribution to data is any algorithm that combines a statistical model with a set of data (the data is fixed), and chooses exactly one of the distributions from the model as the one that "best" reflects the data.
The model is the thing that changes (sort of): we are collapsing it from an entire collection of possibilities into a single best choice. The data is just the data; nothing happens to it at all.
$endgroup$
add a comment |
$begingroup$
In the field of Rasch modelling it is common to fit the data to the model. The model is assumed to be correct and it is the analyst's job to find data which conform to it. The Wikipedia article on Rasch contains more details about the how and the why.
But I agree with others that in general in statistics we fit the model to the data because we can change the model but it is felt to be bad form to select or modify the data.
$endgroup$
add a comment |
$begingroup$
Typically, the observed data are fixed while the model is mutable (e.g. because parameters are estimated), so it is the model that is made to fit the data, not the other way around. (Usually people mean this case when they say either expression.)
When people say they fit data to a model I find myself trying to figure out what the heck did they do to the data?.
[Now if you're transforming data, that would arguably be 'fitting data to a model', but people almost never say that for this case.]
$endgroup$
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
Mar 24 at 8:34
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
Mar 24 at 13:29
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
Mar 24 at 20:16
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
Mar 25 at 15:37
add a comment |
$begingroup$
Usually, we assume our data corresponds to the "real world" and making any modifications means we are moving away from modelling the "real world". For example, one needs to take care removing outliers since even if it makes computation nicer, outliers were still part of our data.
When testing a model or estimating properties of an estimator using bootstrap or other resampling techniques, we may simulate new data using an estimated model and our original data. This makes the assumption that the model is correct, and we are not modifying our original data.
$endgroup$
add a comment |
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "65"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f399094%2fis-a-model-fitted-to-data-or-is-data-fitted-to-a-model%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
4 Answers
4
active
oldest
votes
4 Answers
4
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
Pretty much every source or person I've ever interacted with except the Wolfram source you linked refers to the process as fitting a model to data. This makes sense, since the model is the dynamic object and the data is static (a.k.a. fixed and constant).
To put a point on it, I like Larry Wasserman's approach to this. In his telling, a statistical model is a collection of distributions. For example, the collection of all normal distributions:
$$ { text{Normal}(mu, sigma) : mu, sigma in R, sigma > 0 } $$
or the set of all Poisson distributions:
$$ { text{Poisson}(lambda) : lambda in R, lambda > 0 } $$
Fitting a distribution to data is any algorithm that combines a statistical model with a set of data (the data is fixed), and chooses exactly one of the distributions from the model as the one that "best" reflects the data.
The model is the thing that changes (sort of): we are collapsing it from an entire collection of possibilities into a single best choice. The data is just the data; nothing happens to it at all.
$endgroup$
add a comment |
$begingroup$
Pretty much every source or person I've ever interacted with except the Wolfram source you linked refers to the process as fitting a model to data. This makes sense, since the model is the dynamic object and the data is static (a.k.a. fixed and constant).
To put a point on it, I like Larry Wasserman's approach to this. In his telling, a statistical model is a collection of distributions. For example, the collection of all normal distributions:
$$ { text{Normal}(mu, sigma) : mu, sigma in R, sigma > 0 } $$
or the set of all Poisson distributions:
$$ { text{Poisson}(lambda) : lambda in R, lambda > 0 } $$
Fitting a distribution to data is any algorithm that combines a statistical model with a set of data (the data is fixed), and chooses exactly one of the distributions from the model as the one that "best" reflects the data.
The model is the thing that changes (sort of): we are collapsing it from an entire collection of possibilities into a single best choice. The data is just the data; nothing happens to it at all.
$endgroup$
add a comment |
$begingroup$
Pretty much every source or person I've ever interacted with except the Wolfram source you linked refers to the process as fitting a model to data. This makes sense, since the model is the dynamic object and the data is static (a.k.a. fixed and constant).
To put a point on it, I like Larry Wasserman's approach to this. In his telling, a statistical model is a collection of distributions. For example, the collection of all normal distributions:
$$ { text{Normal}(mu, sigma) : mu, sigma in R, sigma > 0 } $$
or the set of all Poisson distributions:
$$ { text{Poisson}(lambda) : lambda in R, lambda > 0 } $$
Fitting a distribution to data is any algorithm that combines a statistical model with a set of data (the data is fixed), and chooses exactly one of the distributions from the model as the one that "best" reflects the data.
The model is the thing that changes (sort of): we are collapsing it from an entire collection of possibilities into a single best choice. The data is just the data; nothing happens to it at all.
$endgroup$
Pretty much every source or person I've ever interacted with except the Wolfram source you linked refers to the process as fitting a model to data. This makes sense, since the model is the dynamic object and the data is static (a.k.a. fixed and constant).
To put a point on it, I like Larry Wasserman's approach to this. In his telling, a statistical model is a collection of distributions. For example, the collection of all normal distributions:
$$ { text{Normal}(mu, sigma) : mu, sigma in R, sigma > 0 } $$
or the set of all Poisson distributions:
$$ { text{Poisson}(lambda) : lambda in R, lambda > 0 } $$
Fitting a distribution to data is any algorithm that combines a statistical model with a set of data (the data is fixed), and chooses exactly one of the distributions from the model as the one that "best" reflects the data.
The model is the thing that changes (sort of): we are collapsing it from an entire collection of possibilities into a single best choice. The data is just the data; nothing happens to it at all.
edited Mar 25 at 23:55
answered Mar 24 at 4:44
Matthew DruryMatthew Drury
26.9k267107
26.9k267107
add a comment |
add a comment |
$begingroup$
In the field of Rasch modelling it is common to fit the data to the model. The model is assumed to be correct and it is the analyst's job to find data which conform to it. The Wikipedia article on Rasch contains more details about the how and the why.
But I agree with others that in general in statistics we fit the model to the data because we can change the model but it is felt to be bad form to select or modify the data.
$endgroup$
add a comment |
$begingroup$
In the field of Rasch modelling it is common to fit the data to the model. The model is assumed to be correct and it is the analyst's job to find data which conform to it. The Wikipedia article on Rasch contains more details about the how and the why.
But I agree with others that in general in statistics we fit the model to the data because we can change the model but it is felt to be bad form to select or modify the data.
$endgroup$
add a comment |
$begingroup$
In the field of Rasch modelling it is common to fit the data to the model. The model is assumed to be correct and it is the analyst's job to find data which conform to it. The Wikipedia article on Rasch contains more details about the how and the why.
But I agree with others that in general in statistics we fit the model to the data because we can change the model but it is felt to be bad form to select or modify the data.
$endgroup$
In the field of Rasch modelling it is common to fit the data to the model. The model is assumed to be correct and it is the analyst's job to find data which conform to it. The Wikipedia article on Rasch contains more details about the how and the why.
But I agree with others that in general in statistics we fit the model to the data because we can change the model but it is felt to be bad form to select or modify the data.
answered Mar 24 at 14:39
mdeweymdewey
12.5k72344
12.5k72344
add a comment |
add a comment |
$begingroup$
Typically, the observed data are fixed while the model is mutable (e.g. because parameters are estimated), so it is the model that is made to fit the data, not the other way around. (Usually people mean this case when they say either expression.)
When people say they fit data to a model I find myself trying to figure out what the heck did they do to the data?.
[Now if you're transforming data, that would arguably be 'fitting data to a model', but people almost never say that for this case.]
$endgroup$
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
Mar 24 at 8:34
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
Mar 24 at 13:29
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
Mar 24 at 20:16
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
Mar 25 at 15:37
add a comment |
$begingroup$
Typically, the observed data are fixed while the model is mutable (e.g. because parameters are estimated), so it is the model that is made to fit the data, not the other way around. (Usually people mean this case when they say either expression.)
When people say they fit data to a model I find myself trying to figure out what the heck did they do to the data?.
[Now if you're transforming data, that would arguably be 'fitting data to a model', but people almost never say that for this case.]
$endgroup$
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
Mar 24 at 8:34
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
Mar 24 at 13:29
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
Mar 24 at 20:16
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
Mar 25 at 15:37
add a comment |
$begingroup$
Typically, the observed data are fixed while the model is mutable (e.g. because parameters are estimated), so it is the model that is made to fit the data, not the other way around. (Usually people mean this case when they say either expression.)
When people say they fit data to a model I find myself trying to figure out what the heck did they do to the data?.
[Now if you're transforming data, that would arguably be 'fitting data to a model', but people almost never say that for this case.]
$endgroup$
Typically, the observed data are fixed while the model is mutable (e.g. because parameters are estimated), so it is the model that is made to fit the data, not the other way around. (Usually people mean this case when they say either expression.)
When people say they fit data to a model I find myself trying to figure out what the heck did they do to the data?.
[Now if you're transforming data, that would arguably be 'fitting data to a model', but people almost never say that for this case.]
answered Mar 24 at 8:13
Glen_b♦Glen_b
214k23417770
214k23417770
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
Mar 24 at 8:34
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
Mar 24 at 13:29
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
Mar 24 at 20:16
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
Mar 25 at 15:37
add a comment |
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
Mar 24 at 8:34
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
Mar 24 at 13:29
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
Mar 24 at 20:16
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
Mar 25 at 15:37
5
5
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
Mar 24 at 8:34
$begingroup$
Removing outliers would also (arguably) be "fitting data to a model".
$endgroup$
– Federico Poloni
Mar 24 at 8:34
1
1
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
Mar 24 at 13:29
$begingroup$
The phrasing might make sense if they're thinking of it as "fitting (data to a model)". That is, you're doing a process of fitting, and that process of fitting starts from data and transforms it to a model. I agree that's a less common/accurate interpretation versus the "(fitting X) to Y" parse, but I put it out there as a rationale as to why someone might logically say it.
$endgroup$
– R.M.
Mar 24 at 13:29
1
1
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
Mar 24 at 20:16
$begingroup$
@FedericoPoloni Outliers are usually defined indepedently of the model that you later want to use. So even if we would want to call it fitting data, it would not be a model, but to something else.
$endgroup$
– BartoszKP
Mar 24 at 20:16
1
1
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
Mar 25 at 15:37
$begingroup$
+1. There is a reason it's called "data" - it is what is given, see the Latin origin of the word: latindictionary.wikidot.com/verb:dare
$endgroup$
– Christoph Hanck
Mar 25 at 15:37
add a comment |
$begingroup$
Usually, we assume our data corresponds to the "real world" and making any modifications means we are moving away from modelling the "real world". For example, one needs to take care removing outliers since even if it makes computation nicer, outliers were still part of our data.
When testing a model or estimating properties of an estimator using bootstrap or other resampling techniques, we may simulate new data using an estimated model and our original data. This makes the assumption that the model is correct, and we are not modifying our original data.
$endgroup$
add a comment |
$begingroup$
Usually, we assume our data corresponds to the "real world" and making any modifications means we are moving away from modelling the "real world". For example, one needs to take care removing outliers since even if it makes computation nicer, outliers were still part of our data.
When testing a model or estimating properties of an estimator using bootstrap or other resampling techniques, we may simulate new data using an estimated model and our original data. This makes the assumption that the model is correct, and we are not modifying our original data.
$endgroup$
add a comment |
$begingroup$
Usually, we assume our data corresponds to the "real world" and making any modifications means we are moving away from modelling the "real world". For example, one needs to take care removing outliers since even if it makes computation nicer, outliers were still part of our data.
When testing a model or estimating properties of an estimator using bootstrap or other resampling techniques, we may simulate new data using an estimated model and our original data. This makes the assumption that the model is correct, and we are not modifying our original data.
$endgroup$
Usually, we assume our data corresponds to the "real world" and making any modifications means we are moving away from modelling the "real world". For example, one needs to take care removing outliers since even if it makes computation nicer, outliers were still part of our data.
When testing a model or estimating properties of an estimator using bootstrap or other resampling techniques, we may simulate new data using an estimated model and our original data. This makes the assumption that the model is correct, and we are not modifying our original data.
answered Mar 24 at 21:34
qwrqwr
250112
250112
add a comment |
add a comment |
Thanks for contributing an answer to Cross Validated!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f399094%2fis-a-model-fitted-to-data-or-is-data-fitted-to-a-model%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
7
$begingroup$
+1 I am not impressed by the second link, but I am entertained.
$endgroup$
– The Laconic
Mar 24 at 2:38
$begingroup$
Many models fits current data, but data typically fits best one model
$endgroup$
– Agnius Vasiliauskas
Mar 25 at 9:45