How can I normalize Lucene scores?





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ height:90px;width:728px;box-sizing:border-box;
}







1















I want to normalize the Lucene scores between specific numbers



For example, a query returns the following scores...



0.35773993
0.15979238
0.15305749
0.14774998
0.14689445


And another query returns



2.32223993
1.55979222
1.14705711
0.12773933
0.10689411


How can I normalize the scores?










share|improve this question


















  • 3





    What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).

    – MatsLindh
    Nov 23 '18 at 20:48











  • Because I want to see which query is more relevant to the documents..

    – Noran
    Nov 23 '18 at 21:19











  • But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.

    – MatsLindh
    Nov 23 '18 at 21:31


















1















I want to normalize the Lucene scores between specific numbers



For example, a query returns the following scores...



0.35773993
0.15979238
0.15305749
0.14774998
0.14689445


And another query returns



2.32223993
1.55979222
1.14705711
0.12773933
0.10689411


How can I normalize the scores?










share|improve this question


















  • 3





    What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).

    – MatsLindh
    Nov 23 '18 at 20:48











  • Because I want to see which query is more relevant to the documents..

    – Noran
    Nov 23 '18 at 21:19











  • But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.

    – MatsLindh
    Nov 23 '18 at 21:31














1












1








1








I want to normalize the Lucene scores between specific numbers



For example, a query returns the following scores...



0.35773993
0.15979238
0.15305749
0.14774998
0.14689445


And another query returns



2.32223993
1.55979222
1.14705711
0.12773933
0.10689411


How can I normalize the scores?










share|improve this question














I want to normalize the Lucene scores between specific numbers



For example, a query returns the following scores...



0.35773993
0.15979238
0.15305749
0.14774998
0.14689445


And another query returns



2.32223993
1.55979222
1.14705711
0.12773933
0.10689411


How can I normalize the scores?







java apache solr lucene






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 23 '18 at 20:45









NoranNoran

249110




249110








  • 3





    What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).

    – MatsLindh
    Nov 23 '18 at 20:48











  • Because I want to see which query is more relevant to the documents..

    – Noran
    Nov 23 '18 at 21:19











  • But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.

    – MatsLindh
    Nov 23 '18 at 21:31














  • 3





    What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).

    – MatsLindh
    Nov 23 '18 at 20:48











  • Because I want to see which query is more relevant to the documents..

    – Noran
    Nov 23 '18 at 21:19











  • But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.

    – MatsLindh
    Nov 23 '18 at 21:31








3




3





What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).

– MatsLindh
Nov 23 '18 at 20:48





What's your goal with normalization of the scores? Normalization across queries doesn't really make sense (but there's a norm factor in the scoring formula that at least introduces a part for that). Since the last query is different from the first one, the scores returned isn't comparable (i.e. the numeric value doesn't mean any specific).

– MatsLindh
Nov 23 '18 at 20:48













Because I want to see which query is more relevant to the documents..

– Noran
Nov 23 '18 at 21:19





Because I want to see which query is more relevant to the documents..

– Noran
Nov 23 '18 at 21:19













But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.

– MatsLindh
Nov 23 '18 at 21:31





But those numbers won't really tell you that - how do you define that a query is more relevant to documents? If it's just the one giving the highest score (since normalizing wouldn't change that, seeing as that would scale the scores within [0, 1] for [0, max] instead), just pick that one. But that number doesn't really tell you that a query is more relevant (.. which would mean exactly what?) to the document.

– MatsLindh
Nov 23 '18 at 21:31












0






active

oldest

votes












Your Answer






StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53452757%2fhow-can-i-normalize-lucene-scores%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes
















draft saved

draft discarded




















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53452757%2fhow-can-i-normalize-lucene-scores%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

"Incorrect syntax near the keyword 'ON'. (on update cascade, on delete cascade,)

Alcedinidae

Origin of the phrase “under your belt”?