What is the “sep=” metadata you can add to CSVs?
up vote
20
down vote
favorite
While trying to find out how to get CSVs to open correctly in Excel on a Spanish PC (belonging to a customer) I found many answers recommending using sep=,
at the top of the file:
- Easiest way to open CSV with commas in Excel
- How to control CSV import into Excel 2010
- How to open semicolon delimited CSV-files in US-version of Excel
- Space or tabs as separator in CSV
- How to get Excel to interpret the comma as a default delimiter in CSV files?
On the face of it, this approach works but none of those answers give any further information on where this metadata option comes from. I have tried to search for what it means (and hence found all of those answers) but have been unable to get any further information especially given that punctuation is notoriously hard to search for.
My primary concern is whether or not this is an Excel-specific feature. I suspect it is but have found nothing definitive to confirm it.
Some related questions are:
- What characters can be used in this setting?
- What other settings are available (eg. line termination character, quote character, etc).
- Are there any other tools that officially support this feature?
I'm hoping there is a piece of documentation somewhere that someone can point me to that will answer all these questions and more; I just haven't been able to find it.
Some clarification:
The sep=
is not a parameter to a parser. It is meant to be placed inside the CSV. Example:
sep=|
"LETTER"|"ANIMAL"
"a"|"aardvark"
"b"|"bear"
"c"|"cow"
microsoft-excel csv metadata
add a comment |
up vote
20
down vote
favorite
While trying to find out how to get CSVs to open correctly in Excel on a Spanish PC (belonging to a customer) I found many answers recommending using sep=,
at the top of the file:
- Easiest way to open CSV with commas in Excel
- How to control CSV import into Excel 2010
- How to open semicolon delimited CSV-files in US-version of Excel
- Space or tabs as separator in CSV
- How to get Excel to interpret the comma as a default delimiter in CSV files?
On the face of it, this approach works but none of those answers give any further information on where this metadata option comes from. I have tried to search for what it means (and hence found all of those answers) but have been unable to get any further information especially given that punctuation is notoriously hard to search for.
My primary concern is whether or not this is an Excel-specific feature. I suspect it is but have found nothing definitive to confirm it.
Some related questions are:
- What characters can be used in this setting?
- What other settings are available (eg. line termination character, quote character, etc).
- Are there any other tools that officially support this feature?
I'm hoping there is a piece of documentation somewhere that someone can point me to that will answer all these questions and more; I just haven't been able to find it.
Some clarification:
The sep=
is not a parameter to a parser. It is meant to be placed inside the CSV. Example:
sep=|
"LETTER"|"ANIMAL"
"a"|"aardvark"
"b"|"bear"
"c"|"cow"
microsoft-excel csv metadata
4
Yes, this is excel specific.
– Raystafarian
Feb 21 '15 at 13:09
2
Were you able to get anything on this? I am stumped with the same question and I am yet to find any MS specs that describes this feature. If this feature is undocumented by MS, how the heck did it get out in the first place?
– toddlermenot
Dec 24 '15 at 14:56
@toddlermenot Sadly not. All I get is people making the assertions that it is Excel specific with nothing to back up that assertion, which is disappointing.
– Burhan Ali
Dec 24 '15 at 14:59
add a comment |
up vote
20
down vote
favorite
up vote
20
down vote
favorite
While trying to find out how to get CSVs to open correctly in Excel on a Spanish PC (belonging to a customer) I found many answers recommending using sep=,
at the top of the file:
- Easiest way to open CSV with commas in Excel
- How to control CSV import into Excel 2010
- How to open semicolon delimited CSV-files in US-version of Excel
- Space or tabs as separator in CSV
- How to get Excel to interpret the comma as a default delimiter in CSV files?
On the face of it, this approach works but none of those answers give any further information on where this metadata option comes from. I have tried to search for what it means (and hence found all of those answers) but have been unable to get any further information especially given that punctuation is notoriously hard to search for.
My primary concern is whether or not this is an Excel-specific feature. I suspect it is but have found nothing definitive to confirm it.
Some related questions are:
- What characters can be used in this setting?
- What other settings are available (eg. line termination character, quote character, etc).
- Are there any other tools that officially support this feature?
I'm hoping there is a piece of documentation somewhere that someone can point me to that will answer all these questions and more; I just haven't been able to find it.
Some clarification:
The sep=
is not a parameter to a parser. It is meant to be placed inside the CSV. Example:
sep=|
"LETTER"|"ANIMAL"
"a"|"aardvark"
"b"|"bear"
"c"|"cow"
microsoft-excel csv metadata
While trying to find out how to get CSVs to open correctly in Excel on a Spanish PC (belonging to a customer) I found many answers recommending using sep=,
at the top of the file:
- Easiest way to open CSV with commas in Excel
- How to control CSV import into Excel 2010
- How to open semicolon delimited CSV-files in US-version of Excel
- Space or tabs as separator in CSV
- How to get Excel to interpret the comma as a default delimiter in CSV files?
On the face of it, this approach works but none of those answers give any further information on where this metadata option comes from. I have tried to search for what it means (and hence found all of those answers) but have been unable to get any further information especially given that punctuation is notoriously hard to search for.
My primary concern is whether or not this is an Excel-specific feature. I suspect it is but have found nothing definitive to confirm it.
Some related questions are:
- What characters can be used in this setting?
- What other settings are available (eg. line termination character, quote character, etc).
- Are there any other tools that officially support this feature?
I'm hoping there is a piece of documentation somewhere that someone can point me to that will answer all these questions and more; I just haven't been able to find it.
Some clarification:
The sep=
is not a parameter to a parser. It is meant to be placed inside the CSV. Example:
sep=|
"LETTER"|"ANIMAL"
"a"|"aardvark"
"b"|"bear"
"c"|"cow"
microsoft-excel csv metadata
microsoft-excel csv metadata
edited Mar 20 '17 at 10:17
Community♦
1
1
asked Jun 26 '14 at 9:14
Burhan Ali
3101415
3101415
4
Yes, this is excel specific.
– Raystafarian
Feb 21 '15 at 13:09
2
Were you able to get anything on this? I am stumped with the same question and I am yet to find any MS specs that describes this feature. If this feature is undocumented by MS, how the heck did it get out in the first place?
– toddlermenot
Dec 24 '15 at 14:56
@toddlermenot Sadly not. All I get is people making the assertions that it is Excel specific with nothing to back up that assertion, which is disappointing.
– Burhan Ali
Dec 24 '15 at 14:59
add a comment |
4
Yes, this is excel specific.
– Raystafarian
Feb 21 '15 at 13:09
2
Were you able to get anything on this? I am stumped with the same question and I am yet to find any MS specs that describes this feature. If this feature is undocumented by MS, how the heck did it get out in the first place?
– toddlermenot
Dec 24 '15 at 14:56
@toddlermenot Sadly not. All I get is people making the assertions that it is Excel specific with nothing to back up that assertion, which is disappointing.
– Burhan Ali
Dec 24 '15 at 14:59
4
4
Yes, this is excel specific.
– Raystafarian
Feb 21 '15 at 13:09
Yes, this is excel specific.
– Raystafarian
Feb 21 '15 at 13:09
2
2
Were you able to get anything on this? I am stumped with the same question and I am yet to find any MS specs that describes this feature. If this feature is undocumented by MS, how the heck did it get out in the first place?
– toddlermenot
Dec 24 '15 at 14:56
Were you able to get anything on this? I am stumped with the same question and I am yet to find any MS specs that describes this feature. If this feature is undocumented by MS, how the heck did it get out in the first place?
– toddlermenot
Dec 24 '15 at 14:56
@toddlermenot Sadly not. All I get is people making the assertions that it is Excel specific with nothing to back up that assertion, which is disappointing.
– Burhan Ali
Dec 24 '15 at 14:59
@toddlermenot Sadly not. All I get is people making the assertions that it is Excel specific with nothing to back up that assertion, which is disappointing.
– Burhan Ali
Dec 24 '15 at 14:59
add a comment |
4 Answers
4
active
oldest
votes
up vote
5
down vote
RFC 4180 is commonly recognized as the standard for the CSV format, and does not mention any such feature.
W3C Model for Tabular Data and Metadata on the Web mentions it as a feature not within their scope:
Many "CSV" files embed metadata, for example in lines before the header row
of the CSV document. This specification does not define any formats for
embedding metadata within CSV files, aside from the names of columns in the
header row.
So while I did not find any direct proof that this feature is Excel-specific, this apparently still is the case as there seems to be no standardization document covering that feature.
Good sourcing. One thing I should say is that sep=| defines the seperator for use in Excel as the pipe character ('|'). It is not widely used but is not exclusive to Excel. It is a seperator directive. International versions of Excel iirc have different characters, for instance, European language on a machine running excel uses semicolons.
– var firstName
Jun 30 '17 at 15:24
1
I am aware of the use (maybe that was not entirely clear from my answer). I was just pointing out that it seems that Excel introduced that feature, with it being based on any kind of (public) standard.
– Stefan Kögl
Jun 30 '17 at 16:33
add a comment |
up vote
0
down vote
To contribute towards an answer to one of your questions:
"Are there any other tools that officially support this feature?"
This is not supported by Apple Numbers (I've tested using Numbers version 3.6.2).
It's also not supported by csv2json (https://www.npmjs.com/package/csv2json) nor csvtojson (https://www.npmjs.com/package/csvtojson).
add a comment |
up vote
0
down vote
The instruction sep= is used in all the countries where comma is used as decimal delimiter, as Italy. Since Excel export CSV using semicolons instead of commas in those countries, if you want to be sure that the file can be read in USA or other countries, you NEED
sep=;
on top of file. This is Excel-specific and it might be ignored by other applications.
add a comment |
up vote
-3
down vote
Just about any tool that can import from .csv files recognises that comma is not the only possible separator, tabs have been common for a long time for example. I think this goes back to the days of punch cards even, that there needs to be some character that is recognised as end of data field - with the exception of systems that use fixed width fields or length prefixed fields just about every system has need this since the advent of computers.
The 3 essentials are a field separator, a record separator and an end of data marker, csv by default uses comma, newline and end of file for these respectively but just about any valid input can be used, you do, however, need a mechanism for escaping the special markers if they occur within your valid data.
One historic system I came across used newline for end of field, 2 blank lines for end of record and *!*!* END *!*!*
for the end of the data. I have commonly come across all of ,t|:
used as a field separator along with various control characters.
One thing to be wary of is that the French and others use comma as a decimal point - this can cause all sorts of fun and games. There is a degree of standardisation for csv files in rfc4180 but there are times when you need to manually edit files when exporting from one program and importing to another.
To answer the second part of your question Python, for example, includes a csv reader/writer as one of the standard libraries that offers the following options:
delimiter
- This is the field separator, (unless escaped).
doublequote
- If true then a"
within a field will be represented as""
escapechar
- removes any special meaning from any following character, e.g. if it is set tothen quote can be represented as
"
lineterminator
- usually one ofn
,r
,rn
ornr
quotechar
- character used to quote strings, usually"
or'
quoting
- flag as one of:
ALL
- Every field is in quotes, i.e.:"1","Free Beer","ASAP"
MINIMAL
- Only quote fields that have special characters, i.e.:1,"Beer, (Free)", Now, Please
QUOTE_NONNUMERIC
- Quote anything that is not a number, i.e.:1,"Beer, (Free)", "Now", "Please"
NONE
- Don't quote escape instead, i.e.:1,Beer, (Free), Now, Please
skipinitialspace
- skip any leading space of each field so you can use,
rather than just,
as a field separator.
Generally the more flexible and well thought out a tool that can import csv files is the more of these options, (possibly with differing names and/or mechanisms), that it will have.
2
This doesn't answer my question. It explains what CSVs are, which I am already aware of. My question is about a specific piece of syntax. Is there any background that I could add to my question to clarify things?
– Burhan Ali
Jun 26 '14 at 9:53
1
Thanks but that still doesn't help.sep=
is not a parameter to a parser. I have added some clarification to my question.
– Burhan Ali
Jun 26 '14 at 10:17
1
@SteveBarnes But does any tool, for example the Python library you mention, allow options as part of the content of the file? That is, I think, the main point of the question.
– Randy Orrison
Jun 26 '14 at 10:19
1
"Because they one of the many excel oddities." Is it though? That's part of what I am trying to work out. I don't want to know what a CSV is. I don't want to know how to parse a CSV. I want to know whatsep=
is and where it comes from.
– Burhan Ali
Jun 26 '14 at 13:39
1
sep= is a Microsoft Excel Specific hint method that is not covered in any Non-Microsoft specification that I can find. It comes from a part of Microsoft.
– Steve Barnes
Jun 26 '14 at 18:49
|
show 4 more comments
4 Answers
4
active
oldest
votes
4 Answers
4
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
5
down vote
RFC 4180 is commonly recognized as the standard for the CSV format, and does not mention any such feature.
W3C Model for Tabular Data and Metadata on the Web mentions it as a feature not within their scope:
Many "CSV" files embed metadata, for example in lines before the header row
of the CSV document. This specification does not define any formats for
embedding metadata within CSV files, aside from the names of columns in the
header row.
So while I did not find any direct proof that this feature is Excel-specific, this apparently still is the case as there seems to be no standardization document covering that feature.
Good sourcing. One thing I should say is that sep=| defines the seperator for use in Excel as the pipe character ('|'). It is not widely used but is not exclusive to Excel. It is a seperator directive. International versions of Excel iirc have different characters, for instance, European language on a machine running excel uses semicolons.
– var firstName
Jun 30 '17 at 15:24
1
I am aware of the use (maybe that was not entirely clear from my answer). I was just pointing out that it seems that Excel introduced that feature, with it being based on any kind of (public) standard.
– Stefan Kögl
Jun 30 '17 at 16:33
add a comment |
up vote
5
down vote
RFC 4180 is commonly recognized as the standard for the CSV format, and does not mention any such feature.
W3C Model for Tabular Data and Metadata on the Web mentions it as a feature not within their scope:
Many "CSV" files embed metadata, for example in lines before the header row
of the CSV document. This specification does not define any formats for
embedding metadata within CSV files, aside from the names of columns in the
header row.
So while I did not find any direct proof that this feature is Excel-specific, this apparently still is the case as there seems to be no standardization document covering that feature.
Good sourcing. One thing I should say is that sep=| defines the seperator for use in Excel as the pipe character ('|'). It is not widely used but is not exclusive to Excel. It is a seperator directive. International versions of Excel iirc have different characters, for instance, European language on a machine running excel uses semicolons.
– var firstName
Jun 30 '17 at 15:24
1
I am aware of the use (maybe that was not entirely clear from my answer). I was just pointing out that it seems that Excel introduced that feature, with it being based on any kind of (public) standard.
– Stefan Kögl
Jun 30 '17 at 16:33
add a comment |
up vote
5
down vote
up vote
5
down vote
RFC 4180 is commonly recognized as the standard for the CSV format, and does not mention any such feature.
W3C Model for Tabular Data and Metadata on the Web mentions it as a feature not within their scope:
Many "CSV" files embed metadata, for example in lines before the header row
of the CSV document. This specification does not define any formats for
embedding metadata within CSV files, aside from the names of columns in the
header row.
So while I did not find any direct proof that this feature is Excel-specific, this apparently still is the case as there seems to be no standardization document covering that feature.
RFC 4180 is commonly recognized as the standard for the CSV format, and does not mention any such feature.
W3C Model for Tabular Data and Metadata on the Web mentions it as a feature not within their scope:
Many "CSV" files embed metadata, for example in lines before the header row
of the CSV document. This specification does not define any formats for
embedding metadata within CSV files, aside from the names of columns in the
header row.
So while I did not find any direct proof that this feature is Excel-specific, this apparently still is the case as there seems to be no standardization document covering that feature.
answered Jun 30 '17 at 14:24
Stefan Kögl
15025
15025
Good sourcing. One thing I should say is that sep=| defines the seperator for use in Excel as the pipe character ('|'). It is not widely used but is not exclusive to Excel. It is a seperator directive. International versions of Excel iirc have different characters, for instance, European language on a machine running excel uses semicolons.
– var firstName
Jun 30 '17 at 15:24
1
I am aware of the use (maybe that was not entirely clear from my answer). I was just pointing out that it seems that Excel introduced that feature, with it being based on any kind of (public) standard.
– Stefan Kögl
Jun 30 '17 at 16:33
add a comment |
Good sourcing. One thing I should say is that sep=| defines the seperator for use in Excel as the pipe character ('|'). It is not widely used but is not exclusive to Excel. It is a seperator directive. International versions of Excel iirc have different characters, for instance, European language on a machine running excel uses semicolons.
– var firstName
Jun 30 '17 at 15:24
1
I am aware of the use (maybe that was not entirely clear from my answer). I was just pointing out that it seems that Excel introduced that feature, with it being based on any kind of (public) standard.
– Stefan Kögl
Jun 30 '17 at 16:33
Good sourcing. One thing I should say is that sep=| defines the seperator for use in Excel as the pipe character ('|'). It is not widely used but is not exclusive to Excel. It is a seperator directive. International versions of Excel iirc have different characters, for instance, European language on a machine running excel uses semicolons.
– var firstName
Jun 30 '17 at 15:24
Good sourcing. One thing I should say is that sep=| defines the seperator for use in Excel as the pipe character ('|'). It is not widely used but is not exclusive to Excel. It is a seperator directive. International versions of Excel iirc have different characters, for instance, European language on a machine running excel uses semicolons.
– var firstName
Jun 30 '17 at 15:24
1
1
I am aware of the use (maybe that was not entirely clear from my answer). I was just pointing out that it seems that Excel introduced that feature, with it being based on any kind of (public) standard.
– Stefan Kögl
Jun 30 '17 at 16:33
I am aware of the use (maybe that was not entirely clear from my answer). I was just pointing out that it seems that Excel introduced that feature, with it being based on any kind of (public) standard.
– Stefan Kögl
Jun 30 '17 at 16:33
add a comment |
up vote
0
down vote
To contribute towards an answer to one of your questions:
"Are there any other tools that officially support this feature?"
This is not supported by Apple Numbers (I've tested using Numbers version 3.6.2).
It's also not supported by csv2json (https://www.npmjs.com/package/csv2json) nor csvtojson (https://www.npmjs.com/package/csvtojson).
add a comment |
up vote
0
down vote
To contribute towards an answer to one of your questions:
"Are there any other tools that officially support this feature?"
This is not supported by Apple Numbers (I've tested using Numbers version 3.6.2).
It's also not supported by csv2json (https://www.npmjs.com/package/csv2json) nor csvtojson (https://www.npmjs.com/package/csvtojson).
add a comment |
up vote
0
down vote
up vote
0
down vote
To contribute towards an answer to one of your questions:
"Are there any other tools that officially support this feature?"
This is not supported by Apple Numbers (I've tested using Numbers version 3.6.2).
It's also not supported by csv2json (https://www.npmjs.com/package/csv2json) nor csvtojson (https://www.npmjs.com/package/csvtojson).
To contribute towards an answer to one of your questions:
"Are there any other tools that officially support this feature?"
This is not supported by Apple Numbers (I've tested using Numbers version 3.6.2).
It's also not supported by csv2json (https://www.npmjs.com/package/csv2json) nor csvtojson (https://www.npmjs.com/package/csvtojson).
answered Oct 27 '16 at 15:23
kintel
121
121
add a comment |
add a comment |
up vote
0
down vote
The instruction sep= is used in all the countries where comma is used as decimal delimiter, as Italy. Since Excel export CSV using semicolons instead of commas in those countries, if you want to be sure that the file can be read in USA or other countries, you NEED
sep=;
on top of file. This is Excel-specific and it might be ignored by other applications.
add a comment |
up vote
0
down vote
The instruction sep= is used in all the countries where comma is used as decimal delimiter, as Italy. Since Excel export CSV using semicolons instead of commas in those countries, if you want to be sure that the file can be read in USA or other countries, you NEED
sep=;
on top of file. This is Excel-specific and it might be ignored by other applications.
add a comment |
up vote
0
down vote
up vote
0
down vote
The instruction sep= is used in all the countries where comma is used as decimal delimiter, as Italy. Since Excel export CSV using semicolons instead of commas in those countries, if you want to be sure that the file can be read in USA or other countries, you NEED
sep=;
on top of file. This is Excel-specific and it might be ignored by other applications.
The instruction sep= is used in all the countries where comma is used as decimal delimiter, as Italy. Since Excel export CSV using semicolons instead of commas in those countries, if you want to be sure that the file can be read in USA or other countries, you NEED
sep=;
on top of file. This is Excel-specific and it might be ignored by other applications.
answered 8 hours ago
Dario de Judicibus
11
11
add a comment |
add a comment |
up vote
-3
down vote
Just about any tool that can import from .csv files recognises that comma is not the only possible separator, tabs have been common for a long time for example. I think this goes back to the days of punch cards even, that there needs to be some character that is recognised as end of data field - with the exception of systems that use fixed width fields or length prefixed fields just about every system has need this since the advent of computers.
The 3 essentials are a field separator, a record separator and an end of data marker, csv by default uses comma, newline and end of file for these respectively but just about any valid input can be used, you do, however, need a mechanism for escaping the special markers if they occur within your valid data.
One historic system I came across used newline for end of field, 2 blank lines for end of record and *!*!* END *!*!*
for the end of the data. I have commonly come across all of ,t|:
used as a field separator along with various control characters.
One thing to be wary of is that the French and others use comma as a decimal point - this can cause all sorts of fun and games. There is a degree of standardisation for csv files in rfc4180 but there are times when you need to manually edit files when exporting from one program and importing to another.
To answer the second part of your question Python, for example, includes a csv reader/writer as one of the standard libraries that offers the following options:
delimiter
- This is the field separator, (unless escaped).
doublequote
- If true then a"
within a field will be represented as""
escapechar
- removes any special meaning from any following character, e.g. if it is set tothen quote can be represented as
"
lineterminator
- usually one ofn
,r
,rn
ornr
quotechar
- character used to quote strings, usually"
or'
quoting
- flag as one of:
ALL
- Every field is in quotes, i.e.:"1","Free Beer","ASAP"
MINIMAL
- Only quote fields that have special characters, i.e.:1,"Beer, (Free)", Now, Please
QUOTE_NONNUMERIC
- Quote anything that is not a number, i.e.:1,"Beer, (Free)", "Now", "Please"
NONE
- Don't quote escape instead, i.e.:1,Beer, (Free), Now, Please
skipinitialspace
- skip any leading space of each field so you can use,
rather than just,
as a field separator.
Generally the more flexible and well thought out a tool that can import csv files is the more of these options, (possibly with differing names and/or mechanisms), that it will have.
2
This doesn't answer my question. It explains what CSVs are, which I am already aware of. My question is about a specific piece of syntax. Is there any background that I could add to my question to clarify things?
– Burhan Ali
Jun 26 '14 at 9:53
1
Thanks but that still doesn't help.sep=
is not a parameter to a parser. I have added some clarification to my question.
– Burhan Ali
Jun 26 '14 at 10:17
1
@SteveBarnes But does any tool, for example the Python library you mention, allow options as part of the content of the file? That is, I think, the main point of the question.
– Randy Orrison
Jun 26 '14 at 10:19
1
"Because they one of the many excel oddities." Is it though? That's part of what I am trying to work out. I don't want to know what a CSV is. I don't want to know how to parse a CSV. I want to know whatsep=
is and where it comes from.
– Burhan Ali
Jun 26 '14 at 13:39
1
sep= is a Microsoft Excel Specific hint method that is not covered in any Non-Microsoft specification that I can find. It comes from a part of Microsoft.
– Steve Barnes
Jun 26 '14 at 18:49
|
show 4 more comments
up vote
-3
down vote
Just about any tool that can import from .csv files recognises that comma is not the only possible separator, tabs have been common for a long time for example. I think this goes back to the days of punch cards even, that there needs to be some character that is recognised as end of data field - with the exception of systems that use fixed width fields or length prefixed fields just about every system has need this since the advent of computers.
The 3 essentials are a field separator, a record separator and an end of data marker, csv by default uses comma, newline and end of file for these respectively but just about any valid input can be used, you do, however, need a mechanism for escaping the special markers if they occur within your valid data.
One historic system I came across used newline for end of field, 2 blank lines for end of record and *!*!* END *!*!*
for the end of the data. I have commonly come across all of ,t|:
used as a field separator along with various control characters.
One thing to be wary of is that the French and others use comma as a decimal point - this can cause all sorts of fun and games. There is a degree of standardisation for csv files in rfc4180 but there are times when you need to manually edit files when exporting from one program and importing to another.
To answer the second part of your question Python, for example, includes a csv reader/writer as one of the standard libraries that offers the following options:
delimiter
- This is the field separator, (unless escaped).
doublequote
- If true then a"
within a field will be represented as""
escapechar
- removes any special meaning from any following character, e.g. if it is set tothen quote can be represented as
"
lineterminator
- usually one ofn
,r
,rn
ornr
quotechar
- character used to quote strings, usually"
or'
quoting
- flag as one of:
ALL
- Every field is in quotes, i.e.:"1","Free Beer","ASAP"
MINIMAL
- Only quote fields that have special characters, i.e.:1,"Beer, (Free)", Now, Please
QUOTE_NONNUMERIC
- Quote anything that is not a number, i.e.:1,"Beer, (Free)", "Now", "Please"
NONE
- Don't quote escape instead, i.e.:1,Beer, (Free), Now, Please
skipinitialspace
- skip any leading space of each field so you can use,
rather than just,
as a field separator.
Generally the more flexible and well thought out a tool that can import csv files is the more of these options, (possibly with differing names and/or mechanisms), that it will have.
2
This doesn't answer my question. It explains what CSVs are, which I am already aware of. My question is about a specific piece of syntax. Is there any background that I could add to my question to clarify things?
– Burhan Ali
Jun 26 '14 at 9:53
1
Thanks but that still doesn't help.sep=
is not a parameter to a parser. I have added some clarification to my question.
– Burhan Ali
Jun 26 '14 at 10:17
1
@SteveBarnes But does any tool, for example the Python library you mention, allow options as part of the content of the file? That is, I think, the main point of the question.
– Randy Orrison
Jun 26 '14 at 10:19
1
"Because they one of the many excel oddities." Is it though? That's part of what I am trying to work out. I don't want to know what a CSV is. I don't want to know how to parse a CSV. I want to know whatsep=
is and where it comes from.
– Burhan Ali
Jun 26 '14 at 13:39
1
sep= is a Microsoft Excel Specific hint method that is not covered in any Non-Microsoft specification that I can find. It comes from a part of Microsoft.
– Steve Barnes
Jun 26 '14 at 18:49
|
show 4 more comments
up vote
-3
down vote
up vote
-3
down vote
Just about any tool that can import from .csv files recognises that comma is not the only possible separator, tabs have been common for a long time for example. I think this goes back to the days of punch cards even, that there needs to be some character that is recognised as end of data field - with the exception of systems that use fixed width fields or length prefixed fields just about every system has need this since the advent of computers.
The 3 essentials are a field separator, a record separator and an end of data marker, csv by default uses comma, newline and end of file for these respectively but just about any valid input can be used, you do, however, need a mechanism for escaping the special markers if they occur within your valid data.
One historic system I came across used newline for end of field, 2 blank lines for end of record and *!*!* END *!*!*
for the end of the data. I have commonly come across all of ,t|:
used as a field separator along with various control characters.
One thing to be wary of is that the French and others use comma as a decimal point - this can cause all sorts of fun and games. There is a degree of standardisation for csv files in rfc4180 but there are times when you need to manually edit files when exporting from one program and importing to another.
To answer the second part of your question Python, for example, includes a csv reader/writer as one of the standard libraries that offers the following options:
delimiter
- This is the field separator, (unless escaped).
doublequote
- If true then a"
within a field will be represented as""
escapechar
- removes any special meaning from any following character, e.g. if it is set tothen quote can be represented as
"
lineterminator
- usually one ofn
,r
,rn
ornr
quotechar
- character used to quote strings, usually"
or'
quoting
- flag as one of:
ALL
- Every field is in quotes, i.e.:"1","Free Beer","ASAP"
MINIMAL
- Only quote fields that have special characters, i.e.:1,"Beer, (Free)", Now, Please
QUOTE_NONNUMERIC
- Quote anything that is not a number, i.e.:1,"Beer, (Free)", "Now", "Please"
NONE
- Don't quote escape instead, i.e.:1,Beer, (Free), Now, Please
skipinitialspace
- skip any leading space of each field so you can use,
rather than just,
as a field separator.
Generally the more flexible and well thought out a tool that can import csv files is the more of these options, (possibly with differing names and/or mechanisms), that it will have.
Just about any tool that can import from .csv files recognises that comma is not the only possible separator, tabs have been common for a long time for example. I think this goes back to the days of punch cards even, that there needs to be some character that is recognised as end of data field - with the exception of systems that use fixed width fields or length prefixed fields just about every system has need this since the advent of computers.
The 3 essentials are a field separator, a record separator and an end of data marker, csv by default uses comma, newline and end of file for these respectively but just about any valid input can be used, you do, however, need a mechanism for escaping the special markers if they occur within your valid data.
One historic system I came across used newline for end of field, 2 blank lines for end of record and *!*!* END *!*!*
for the end of the data. I have commonly come across all of ,t|:
used as a field separator along with various control characters.
One thing to be wary of is that the French and others use comma as a decimal point - this can cause all sorts of fun and games. There is a degree of standardisation for csv files in rfc4180 but there are times when you need to manually edit files when exporting from one program and importing to another.
To answer the second part of your question Python, for example, includes a csv reader/writer as one of the standard libraries that offers the following options:
delimiter
- This is the field separator, (unless escaped).
doublequote
- If true then a"
within a field will be represented as""
escapechar
- removes any special meaning from any following character, e.g. if it is set tothen quote can be represented as
"
lineterminator
- usually one ofn
,r
,rn
ornr
quotechar
- character used to quote strings, usually"
or'
quoting
- flag as one of:
ALL
- Every field is in quotes, i.e.:"1","Free Beer","ASAP"
MINIMAL
- Only quote fields that have special characters, i.e.:1,"Beer, (Free)", Now, Please
QUOTE_NONNUMERIC
- Quote anything that is not a number, i.e.:1,"Beer, (Free)", "Now", "Please"
NONE
- Don't quote escape instead, i.e.:1,Beer, (Free), Now, Please
skipinitialspace
- skip any leading space of each field so you can use,
rather than just,
as a field separator.
Generally the more flexible and well thought out a tool that can import csv files is the more of these options, (possibly with differing names and/or mechanisms), that it will have.
edited Jun 26 '14 at 10:08
answered Jun 26 '14 at 9:29
Steve Barnes
19816
19816
2
This doesn't answer my question. It explains what CSVs are, which I am already aware of. My question is about a specific piece of syntax. Is there any background that I could add to my question to clarify things?
– Burhan Ali
Jun 26 '14 at 9:53
1
Thanks but that still doesn't help.sep=
is not a parameter to a parser. I have added some clarification to my question.
– Burhan Ali
Jun 26 '14 at 10:17
1
@SteveBarnes But does any tool, for example the Python library you mention, allow options as part of the content of the file? That is, I think, the main point of the question.
– Randy Orrison
Jun 26 '14 at 10:19
1
"Because they one of the many excel oddities." Is it though? That's part of what I am trying to work out. I don't want to know what a CSV is. I don't want to know how to parse a CSV. I want to know whatsep=
is and where it comes from.
– Burhan Ali
Jun 26 '14 at 13:39
1
sep= is a Microsoft Excel Specific hint method that is not covered in any Non-Microsoft specification that I can find. It comes from a part of Microsoft.
– Steve Barnes
Jun 26 '14 at 18:49
|
show 4 more comments
2
This doesn't answer my question. It explains what CSVs are, which I am already aware of. My question is about a specific piece of syntax. Is there any background that I could add to my question to clarify things?
– Burhan Ali
Jun 26 '14 at 9:53
1
Thanks but that still doesn't help.sep=
is not a parameter to a parser. I have added some clarification to my question.
– Burhan Ali
Jun 26 '14 at 10:17
1
@SteveBarnes But does any tool, for example the Python library you mention, allow options as part of the content of the file? That is, I think, the main point of the question.
– Randy Orrison
Jun 26 '14 at 10:19
1
"Because they one of the many excel oddities." Is it though? That's part of what I am trying to work out. I don't want to know what a CSV is. I don't want to know how to parse a CSV. I want to know whatsep=
is and where it comes from.
– Burhan Ali
Jun 26 '14 at 13:39
1
sep= is a Microsoft Excel Specific hint method that is not covered in any Non-Microsoft specification that I can find. It comes from a part of Microsoft.
– Steve Barnes
Jun 26 '14 at 18:49
2
2
This doesn't answer my question. It explains what CSVs are, which I am already aware of. My question is about a specific piece of syntax. Is there any background that I could add to my question to clarify things?
– Burhan Ali
Jun 26 '14 at 9:53
This doesn't answer my question. It explains what CSVs are, which I am already aware of. My question is about a specific piece of syntax. Is there any background that I could add to my question to clarify things?
– Burhan Ali
Jun 26 '14 at 9:53
1
1
Thanks but that still doesn't help.
sep=
is not a parameter to a parser. I have added some clarification to my question.– Burhan Ali
Jun 26 '14 at 10:17
Thanks but that still doesn't help.
sep=
is not a parameter to a parser. I have added some clarification to my question.– Burhan Ali
Jun 26 '14 at 10:17
1
1
@SteveBarnes But does any tool, for example the Python library you mention, allow options as part of the content of the file? That is, I think, the main point of the question.
– Randy Orrison
Jun 26 '14 at 10:19
@SteveBarnes But does any tool, for example the Python library you mention, allow options as part of the content of the file? That is, I think, the main point of the question.
– Randy Orrison
Jun 26 '14 at 10:19
1
1
"Because they one of the many excel oddities." Is it though? That's part of what I am trying to work out. I don't want to know what a CSV is. I don't want to know how to parse a CSV. I want to know what
sep=
is and where it comes from.– Burhan Ali
Jun 26 '14 at 13:39
"Because they one of the many excel oddities." Is it though? That's part of what I am trying to work out. I don't want to know what a CSV is. I don't want to know how to parse a CSV. I want to know what
sep=
is and where it comes from.– Burhan Ali
Jun 26 '14 at 13:39
1
1
sep= is a Microsoft Excel Specific hint method that is not covered in any Non-Microsoft specification that I can find. It comes from a part of Microsoft.
– Steve Barnes
Jun 26 '14 at 18:49
sep= is a Microsoft Excel Specific hint method that is not covered in any Non-Microsoft specification that I can find. It comes from a part of Microsoft.
– Steve Barnes
Jun 26 '14 at 18:49
|
show 4 more comments
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f773644%2fwhat-is-the-sep-metadata-you-can-add-to-csvs%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
4
Yes, this is excel specific.
– Raystafarian
Feb 21 '15 at 13:09
2
Were you able to get anything on this? I am stumped with the same question and I am yet to find any MS specs that describes this feature. If this feature is undocumented by MS, how the heck did it get out in the first place?
– toddlermenot
Dec 24 '15 at 14:56
@toddlermenot Sadly not. All I get is people making the assertions that it is Excel specific with nothing to back up that assertion, which is disappointing.
– Burhan Ali
Dec 24 '15 at 14:59