How can I normalize Lucene scores?
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ height:90px;width:728px;box-sizing:border-box;
}
I want to normalize the Lucene scores between specific numbers
For example, a query returns the following scores...
0.35773993
0.15979238
0.15305749
0.14774998
0.14689445
And another query returns
2.32223993
1.55979222
1.14705711
0.12773933
0.10689411
How can I normalize the scores?
java apache solr lucene
add a comment |
I want to normalize the Lucene scores between specific numbers
For example, a query returns the following scores...
0.35773993
0.15979238
0.15305749
0.14774998
0.14689445
And another query returns
2.32223993
1.55979222
1.14705711
0.12773933
0.10689411
How can I normalize the scores?
java apache solr lucene
3
What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).
– MatsLindh
Nov 23 '18 at 20:48
Because I want to see which query is more relevant to the documents..
– Noran
Nov 23 '18 at 21:19
But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.
– MatsLindh
Nov 23 '18 at 21:31
add a comment |
I want to normalize the Lucene scores between specific numbers
For example, a query returns the following scores...
0.35773993
0.15979238
0.15305749
0.14774998
0.14689445
And another query returns
2.32223993
1.55979222
1.14705711
0.12773933
0.10689411
How can I normalize the scores?
java apache solr lucene
I want to normalize the Lucene scores between specific numbers
For example, a query returns the following scores...
0.35773993
0.15979238
0.15305749
0.14774998
0.14689445
And another query returns
2.32223993
1.55979222
1.14705711
0.12773933
0.10689411
How can I normalize the scores?
java apache solr lucene
java apache solr lucene
asked Nov 23 '18 at 20:45
NoranNoran
249110
249110
3
What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).
– MatsLindh
Nov 23 '18 at 20:48
Because I want to see which query is more relevant to the documents..
– Noran
Nov 23 '18 at 21:19
But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.
– MatsLindh
Nov 23 '18 at 21:31
add a comment |
3
What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).
– MatsLindh
Nov 23 '18 at 20:48
Because I want to see which query is more relevant to the documents..
– Noran
Nov 23 '18 at 21:19
But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.
– MatsLindh
Nov 23 '18 at 21:31
3
3
What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).
– MatsLindh
Nov 23 '18 at 20:48
What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).
– MatsLindh
Nov 23 '18 at 20:48
Because I want to see which query is more relevant to the documents..
– Noran
Nov 23 '18 at 21:19
Because I want to see which query is more relevant to the documents..
– Noran
Nov 23 '18 at 21:19
But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.
– MatsLindh
Nov 23 '18 at 21:31
But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.
– MatsLindh
Nov 23 '18 at 21:31
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53452757%2fhow-can-i-normalize-lucene-scores%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53452757%2fhow-can-i-normalize-lucene-scores%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
3
What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).
– MatsLindh
Nov 23 '18 at 20:48
Because I want to see which query is more relevant to the documents..
– Noran
Nov 23 '18 at 21:19
But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.
– MatsLindh
Nov 23 '18 at 21:31