MS Excel sort column containing non-latin characters
Is it possible to sort a column containing text strings in Excel when the script is non-latin, e.g. cyrillic?
If so, how?
+------------------------------------------------+------------------------------------------------+
| CORRECT ORDER | A-Z SORT IN EXCEL |
+------------------------------------------------+------------------------------------------------+
| 3M – Шинэ Зеланд | 3M – Шинэ Зеланд |
| Бристол Майэрз – Энэтхэг | Koника Минолта – Австрали |
| Бупа Эрүүл мэндийн даатгал – Tайланд | Maэрск Шипинг – Шинэ Зеланд |
| Бхарти Телевенчерз Лтд. – Энэтхэг | Moторола – Энэтхэг |
| ГлаксоСмитКлайн – Шинэ Зеланд | Oлимпус Оптикал – Япон |
| Ди Эйч Эл – Австрали | Toёота Файнэншл – Австрали |
| Жeнeрaл Moторз – Энэтхэг | Tайкo Хэлткэйр- Сингапур |
| Жэй-Ви-Си – Япон | Бристол Майэрз – Энэтхэг |
| Инграм Микро – Австрали | Бупа Эрүүл мэндийн даатгал – Tайланд |
| Koника Минолта – Австрали | Бхарти Телевенчерз Лтд. – Энэтхэг |
| Кап Жемини – Энэтхэг | ГлаксоСмитКлайн – Шинэ Зеланд |
| Ковансис компани – Энэтхэг | Ди Эйч Эл – Австрали |
| Лексмарк – Австрали | Жeнeрaл Moторз – Энэтхэг |
| Maэрск Шипинг – Шинэ Зеланд | Жэй-Ви-Си – Япон |
| Moторола – Энэтхэг | Инграм Микро – Австрали |
| Нeстле Глобал – Австрали | Кап Жемини – Энэтхэг |
| Нокиа – Япон | Ковансис компани – Энэтхэг |
| Oлимпус Оптикал – Япон | Лексмарк – Австрали |
| Рийдерз Дайжест – Австрали | Нeстле Глобал – Австрали |
| Си Ай Жи Ай Инк – Филиппин | Нокиа – Япон |
| Стандард энд Пуэрз – Япон | Рийдерз Дайжест – Австрали |
| Статистикийн товчоо – Австрали | Си Ай Жи Ай Инк – Филиппин |
| Toёота Файнэншл – Австрали | Стандард энд Пуэрз – Япон |
| Tайкo Хэлткэйр- Сингапур | Статистикийн товчоо – Австрали |
| Федерал зочид буудал, амралты газар – Австрали | Федерал зочид буудал, амралты газар – Австрали |
| Форд – Австрали | Форд – Австрали |
| Хана семикондактор – Тайланд | Хана семикондактор – Тайланд |
| Хэсс ойл энд газ – Maлайз | Хэсс ойл энд газ – Maлайз |
| Эй Би Эн Aмро – Австрали | Эй Би Эн Aмро – Австрали |
| Эй Эм Ди – Сингапур | Эй Эм Ди – Сингапур |
+------------------------------------------------+------------------------------------------------+
microsoft-excel unicode multilingual
add a comment |
Is it possible to sort a column containing text strings in Excel when the script is non-latin, e.g. cyrillic?
If so, how?
+------------------------------------------------+------------------------------------------------+
| CORRECT ORDER | A-Z SORT IN EXCEL |
+------------------------------------------------+------------------------------------------------+
| 3M – Шинэ Зеланд | 3M – Шинэ Зеланд |
| Бристол Майэрз – Энэтхэг | Koника Минолта – Австрали |
| Бупа Эрүүл мэндийн даатгал – Tайланд | Maэрск Шипинг – Шинэ Зеланд |
| Бхарти Телевенчерз Лтд. – Энэтхэг | Moторола – Энэтхэг |
| ГлаксоСмитКлайн – Шинэ Зеланд | Oлимпус Оптикал – Япон |
| Ди Эйч Эл – Австрали | Toёота Файнэншл – Австрали |
| Жeнeрaл Moторз – Энэтхэг | Tайкo Хэлткэйр- Сингапур |
| Жэй-Ви-Си – Япон | Бристол Майэрз – Энэтхэг |
| Инграм Микро – Австрали | Бупа Эрүүл мэндийн даатгал – Tайланд |
| Koника Минолта – Австрали | Бхарти Телевенчерз Лтд. – Энэтхэг |
| Кап Жемини – Энэтхэг | ГлаксоСмитКлайн – Шинэ Зеланд |
| Ковансис компани – Энэтхэг | Ди Эйч Эл – Австрали |
| Лексмарк – Австрали | Жeнeрaл Moторз – Энэтхэг |
| Maэрск Шипинг – Шинэ Зеланд | Жэй-Ви-Си – Япон |
| Moторола – Энэтхэг | Инграм Микро – Австрали |
| Нeстле Глобал – Австрали | Кап Жемини – Энэтхэг |
| Нокиа – Япон | Ковансис компани – Энэтхэг |
| Oлимпус Оптикал – Япон | Лексмарк – Австрали |
| Рийдерз Дайжест – Австрали | Нeстле Глобал – Австрали |
| Си Ай Жи Ай Инк – Филиппин | Нокиа – Япон |
| Стандард энд Пуэрз – Япон | Рийдерз Дайжест – Австрали |
| Статистикийн товчоо – Австрали | Си Ай Жи Ай Инк – Филиппин |
| Toёота Файнэншл – Австрали | Стандард энд Пуэрз – Япон |
| Tайкo Хэлткэйр- Сингапур | Статистикийн товчоо – Австрали |
| Федерал зочид буудал, амралты газар – Австрали | Федерал зочид буудал, амралты газар – Австрали |
| Форд – Австрали | Форд – Австрали |
| Хана семикондактор – Тайланд | Хана семикондактор – Тайланд |
| Хэсс ойл энд газ – Maлайз | Хэсс ойл энд газ – Maлайз |
| Эй Би Эн Aмро – Австрали | Эй Би Эн Aмро – Австрали |
| Эй Эм Ди – Сингапур | Эй Эм Ди – Сингапур |
+------------------------------------------------+------------------------------------------------+
microsoft-excel unicode multilingual
2
Excel will sort the data with cyrillic or other script used without complaint. Or do you have a question about exactly how this sort occurs, or a specific claim that it is doing it incorrectly? Please provide a specific ecample.
– Madball73
Apr 30 '14 at 15:44
The problem occurred because the character sets were mixed, which was hard to spot, the Latin K was used instead of the cyrillic (alt-01050)
– Loopo
May 1 '14 at 8:26
So, is it fixed now, if you replace latin K with the correct symbol?
– Madball73
May 1 '14 at 11:58
add a comment |
Is it possible to sort a column containing text strings in Excel when the script is non-latin, e.g. cyrillic?
If so, how?
+------------------------------------------------+------------------------------------------------+
| CORRECT ORDER | A-Z SORT IN EXCEL |
+------------------------------------------------+------------------------------------------------+
| 3M – Шинэ Зеланд | 3M – Шинэ Зеланд |
| Бристол Майэрз – Энэтхэг | Koника Минолта – Австрали |
| Бупа Эрүүл мэндийн даатгал – Tайланд | Maэрск Шипинг – Шинэ Зеланд |
| Бхарти Телевенчерз Лтд. – Энэтхэг | Moторола – Энэтхэг |
| ГлаксоСмитКлайн – Шинэ Зеланд | Oлимпус Оптикал – Япон |
| Ди Эйч Эл – Австрали | Toёота Файнэншл – Австрали |
| Жeнeрaл Moторз – Энэтхэг | Tайкo Хэлткэйр- Сингапур |
| Жэй-Ви-Си – Япон | Бристол Майэрз – Энэтхэг |
| Инграм Микро – Австрали | Бупа Эрүүл мэндийн даатгал – Tайланд |
| Koника Минолта – Австрали | Бхарти Телевенчерз Лтд. – Энэтхэг |
| Кап Жемини – Энэтхэг | ГлаксоСмитКлайн – Шинэ Зеланд |
| Ковансис компани – Энэтхэг | Ди Эйч Эл – Австрали |
| Лексмарк – Австрали | Жeнeрaл Moторз – Энэтхэг |
| Maэрск Шипинг – Шинэ Зеланд | Жэй-Ви-Си – Япон |
| Moторола – Энэтхэг | Инграм Микро – Австрали |
| Нeстле Глобал – Австрали | Кап Жемини – Энэтхэг |
| Нокиа – Япон | Ковансис компани – Энэтхэг |
| Oлимпус Оптикал – Япон | Лексмарк – Австрали |
| Рийдерз Дайжест – Австрали | Нeстле Глобал – Австрали |
| Си Ай Жи Ай Инк – Филиппин | Нокиа – Япон |
| Стандард энд Пуэрз – Япон | Рийдерз Дайжест – Австрали |
| Статистикийн товчоо – Австрали | Си Ай Жи Ай Инк – Филиппин |
| Toёота Файнэншл – Австрали | Стандард энд Пуэрз – Япон |
| Tайкo Хэлткэйр- Сингапур | Статистикийн товчоо – Австрали |
| Федерал зочид буудал, амралты газар – Австрали | Федерал зочид буудал, амралты газар – Австрали |
| Форд – Австрали | Форд – Австрали |
| Хана семикондактор – Тайланд | Хана семикондактор – Тайланд |
| Хэсс ойл энд газ – Maлайз | Хэсс ойл энд газ – Maлайз |
| Эй Би Эн Aмро – Австрали | Эй Би Эн Aмро – Австрали |
| Эй Эм Ди – Сингапур | Эй Эм Ди – Сингапур |
+------------------------------------------------+------------------------------------------------+
microsoft-excel unicode multilingual
Is it possible to sort a column containing text strings in Excel when the script is non-latin, e.g. cyrillic?
If so, how?
+------------------------------------------------+------------------------------------------------+
| CORRECT ORDER | A-Z SORT IN EXCEL |
+------------------------------------------------+------------------------------------------------+
| 3M – Шинэ Зеланд | 3M – Шинэ Зеланд |
| Бристол Майэрз – Энэтхэг | Koника Минолта – Австрали |
| Бупа Эрүүл мэндийн даатгал – Tайланд | Maэрск Шипинг – Шинэ Зеланд |
| Бхарти Телевенчерз Лтд. – Энэтхэг | Moторола – Энэтхэг |
| ГлаксоСмитКлайн – Шинэ Зеланд | Oлимпус Оптикал – Япон |
| Ди Эйч Эл – Австрали | Toёота Файнэншл – Австрали |
| Жeнeрaл Moторз – Энэтхэг | Tайкo Хэлткэйр- Сингапур |
| Жэй-Ви-Си – Япон | Бристол Майэрз – Энэтхэг |
| Инграм Микро – Австрали | Бупа Эрүүл мэндийн даатгал – Tайланд |
| Koника Минолта – Австрали | Бхарти Телевенчерз Лтд. – Энэтхэг |
| Кап Жемини – Энэтхэг | ГлаксоСмитКлайн – Шинэ Зеланд |
| Ковансис компани – Энэтхэг | Ди Эйч Эл – Австрали |
| Лексмарк – Австрали | Жeнeрaл Moторз – Энэтхэг |
| Maэрск Шипинг – Шинэ Зеланд | Жэй-Ви-Си – Япон |
| Moторола – Энэтхэг | Инграм Микро – Австрали |
| Нeстле Глобал – Австрали | Кап Жемини – Энэтхэг |
| Нокиа – Япон | Ковансис компани – Энэтхэг |
| Oлимпус Оптикал – Япон | Лексмарк – Австрали |
| Рийдерз Дайжест – Австрали | Нeстле Глобал – Австрали |
| Си Ай Жи Ай Инк – Филиппин | Нокиа – Япон |
| Стандард энд Пуэрз – Япон | Рийдерз Дайжест – Австрали |
| Статистикийн товчоо – Австрали | Си Ай Жи Ай Инк – Филиппин |
| Toёота Файнэншл – Австрали | Стандард энд Пуэрз – Япон |
| Tайкo Хэлткэйр- Сингапур | Статистикийн товчоо – Австрали |
| Федерал зочид буудал, амралты газар – Австрали | Федерал зочид буудал, амралты газар – Австрали |
| Форд – Австрали | Форд – Австрали |
| Хана семикондактор – Тайланд | Хана семикондактор – Тайланд |
| Хэсс ойл энд газ – Maлайз | Хэсс ойл энд газ – Maлайз |
| Эй Би Эн Aмро – Австрали | Эй Би Эн Aмро – Австрали |
| Эй Эм Ди – Сингапур | Эй Эм Ди – Сингапур |
+------------------------------------------------+------------------------------------------------+
microsoft-excel unicode multilingual
microsoft-excel unicode multilingual
edited May 1 '14 at 8:59
asked Apr 30 '14 at 15:28
Loopo
351516
351516
2
Excel will sort the data with cyrillic or other script used without complaint. Or do you have a question about exactly how this sort occurs, or a specific claim that it is doing it incorrectly? Please provide a specific ecample.
– Madball73
Apr 30 '14 at 15:44
The problem occurred because the character sets were mixed, which was hard to spot, the Latin K was used instead of the cyrillic (alt-01050)
– Loopo
May 1 '14 at 8:26
So, is it fixed now, if you replace latin K with the correct symbol?
– Madball73
May 1 '14 at 11:58
add a comment |
2
Excel will sort the data with cyrillic or other script used without complaint. Or do you have a question about exactly how this sort occurs, or a specific claim that it is doing it incorrectly? Please provide a specific ecample.
– Madball73
Apr 30 '14 at 15:44
The problem occurred because the character sets were mixed, which was hard to spot, the Latin K was used instead of the cyrillic (alt-01050)
– Loopo
May 1 '14 at 8:26
So, is it fixed now, if you replace latin K with the correct symbol?
– Madball73
May 1 '14 at 11:58
2
2
Excel will sort the data with cyrillic or other script used without complaint. Or do you have a question about exactly how this sort occurs, or a specific claim that it is doing it incorrectly? Please provide a specific ecample.
– Madball73
Apr 30 '14 at 15:44
Excel will sort the data with cyrillic or other script used without complaint. Or do you have a question about exactly how this sort occurs, or a specific claim that it is doing it incorrectly? Please provide a specific ecample.
– Madball73
Apr 30 '14 at 15:44
The problem occurred because the character sets were mixed, which was hard to spot, the Latin K was used instead of the cyrillic (alt-01050)
– Loopo
May 1 '14 at 8:26
The problem occurred because the character sets were mixed, which was hard to spot, the Latin K was used instead of the cyrillic (alt-01050)
– Loopo
May 1 '14 at 8:26
So, is it fixed now, if you replace latin K with the correct symbol?
– Madball73
May 1 '14 at 11:58
So, is it fixed now, if you replace latin K with the correct symbol?
– Madball73
May 1 '14 at 11:58
add a comment |
1 Answer
1
active
oldest
votes
IMHO, it is possible to sort. 1st.. detect and replace all the 'latinized' cyrillic letters . Let say your list start at A2 cell, put =left(A2) in F2, =CODE(F2) in G2 and =UNICODE(F2) in H2 and drag downwards.
in you see the same value in column G and H, then the character is actually a latin instead of Cyrillic. Eg. the kappa copied from unicode will show code() of 63 and Unicode of 922. while the one in your list shows 75 for both.. (so.. it is not the recognized cyrillic kappa).
That how you may detect if it was the correct letters. (AFAIK, in your list the letter M, O, T, is also latinized. )
add a comment |
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "3"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f748041%2fms-excel-sort-column-containing-non-latin-characters%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
IMHO, it is possible to sort. 1st.. detect and replace all the 'latinized' cyrillic letters . Let say your list start at A2 cell, put =left(A2) in F2, =CODE(F2) in G2 and =UNICODE(F2) in H2 and drag downwards.
in you see the same value in column G and H, then the character is actually a latin instead of Cyrillic. Eg. the kappa copied from unicode will show code() of 63 and Unicode of 922. while the one in your list shows 75 for both.. (so.. it is not the recognized cyrillic kappa).
That how you may detect if it was the correct letters. (AFAIK, in your list the letter M, O, T, is also latinized. )
add a comment |
IMHO, it is possible to sort. 1st.. detect and replace all the 'latinized' cyrillic letters . Let say your list start at A2 cell, put =left(A2) in F2, =CODE(F2) in G2 and =UNICODE(F2) in H2 and drag downwards.
in you see the same value in column G and H, then the character is actually a latin instead of Cyrillic. Eg. the kappa copied from unicode will show code() of 63 and Unicode of 922. while the one in your list shows 75 for both.. (so.. it is not the recognized cyrillic kappa).
That how you may detect if it was the correct letters. (AFAIK, in your list the letter M, O, T, is also latinized. )
add a comment |
IMHO, it is possible to sort. 1st.. detect and replace all the 'latinized' cyrillic letters . Let say your list start at A2 cell, put =left(A2) in F2, =CODE(F2) in G2 and =UNICODE(F2) in H2 and drag downwards.
in you see the same value in column G and H, then the character is actually a latin instead of Cyrillic. Eg. the kappa copied from unicode will show code() of 63 and Unicode of 922. while the one in your list shows 75 for both.. (so.. it is not the recognized cyrillic kappa).
That how you may detect if it was the correct letters. (AFAIK, in your list the letter M, O, T, is also latinized. )
IMHO, it is possible to sort. 1st.. detect and replace all the 'latinized' cyrillic letters . Let say your list start at A2 cell, put =left(A2) in F2, =CODE(F2) in G2 and =UNICODE(F2) in H2 and drag downwards.
in you see the same value in column G and H, then the character is actually a latin instead of Cyrillic. Eg. the kappa copied from unicode will show code() of 63 and Unicode of 922. while the one in your list shows 75 for both.. (so.. it is not the recognized cyrillic kappa).
That how you may detect if it was the correct letters. (AFAIK, in your list the letter M, O, T, is also latinized. )
answered Dec 11 at 8:09
p._phidot_
56429
56429
add a comment |
add a comment |
Thanks for contributing an answer to Super User!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f748041%2fms-excel-sort-column-containing-non-latin-characters%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
2
Excel will sort the data with cyrillic or other script used without complaint. Or do you have a question about exactly how this sort occurs, or a specific claim that it is doing it incorrectly? Please provide a specific ecample.
– Madball73
Apr 30 '14 at 15:44
The problem occurred because the character sets were mixed, which was hard to spot, the Latin K was used instead of the cyrillic (alt-01050)
– Loopo
May 1 '14 at 8:26
So, is it fixed now, if you replace latin K with the correct symbol?
– Madball73
May 1 '14 at 11:58