Import cosine similarities into a clustering method [Python 3]
up vote
-1
down vote
favorite
I am currently having a dilemma of how to use the cosine similarities into a clustering method. What I did was take the public firms in the US for a given year, get their business description and then computed the cosine similarity between the documents. For example, this is what a portion of the data looks like:
Now, the next step is to group those companies into groups based on the similarity between them. However, I don't know how to put the similarity scores to use. I looked at a few tutorials online on how to use clustering algorithms in python with Scikit-learn, but from what I have seen, they had an X and Y variable for a data point, while I have one measure between 2 variables (company 1 and company 2). If anyone can point me in the right direct or has some insight about what I should do please share. Much appreciated! Thank you for you time!
If you have any questions please let me know
python-3.x cluster-analysis hierarchical-clustering cosine-similarity
add a comment |
up vote
-1
down vote
favorite
I am currently having a dilemma of how to use the cosine similarities into a clustering method. What I did was take the public firms in the US for a given year, get their business description and then computed the cosine similarity between the documents. For example, this is what a portion of the data looks like:
Now, the next step is to group those companies into groups based on the similarity between them. However, I don't know how to put the similarity scores to use. I looked at a few tutorials online on how to use clustering algorithms in python with Scikit-learn, but from what I have seen, they had an X and Y variable for a data point, while I have one measure between 2 variables (company 1 and company 2). If anyone can point me in the right direct or has some insight about what I should do please share. Much appreciated! Thank you for you time!
If you have any questions please let me know
python-3.x cluster-analysis hierarchical-clustering cosine-similarity
add a comment |
up vote
-1
down vote
favorite
up vote
-1
down vote
favorite
I am currently having a dilemma of how to use the cosine similarities into a clustering method. What I did was take the public firms in the US for a given year, get their business description and then computed the cosine similarity between the documents. For example, this is what a portion of the data looks like:
Now, the next step is to group those companies into groups based on the similarity between them. However, I don't know how to put the similarity scores to use. I looked at a few tutorials online on how to use clustering algorithms in python with Scikit-learn, but from what I have seen, they had an X and Y variable for a data point, while I have one measure between 2 variables (company 1 and company 2). If anyone can point me in the right direct or has some insight about what I should do please share. Much appreciated! Thank you for you time!
If you have any questions please let me know
python-3.x cluster-analysis hierarchical-clustering cosine-similarity
I am currently having a dilemma of how to use the cosine similarities into a clustering method. What I did was take the public firms in the US for a given year, get their business description and then computed the cosine similarity between the documents. For example, this is what a portion of the data looks like:
Now, the next step is to group those companies into groups based on the similarity between them. However, I don't know how to put the similarity scores to use. I looked at a few tutorials online on how to use clustering algorithms in python with Scikit-learn, but from what I have seen, they had an X and Y variable for a data point, while I have one measure between 2 variables (company 1 and company 2). If anyone can point me in the right direct or has some insight about what I should do please share. Much appreciated! Thank you for you time!
If you have any questions please let me know
python-3.x cluster-analysis hierarchical-clustering cosine-similarity
python-3.x cluster-analysis hierarchical-clustering cosine-similarity
asked Nov 18 at 13:45
Adrian
678
678
add a comment |
add a comment |
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53361556%2fimport-cosine-similarities-into-a-clustering-method-python-3%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown