Can each page in a PDF contain its own metadata?
up vote
1
down vote
favorite
Is it possible to have the same field names, but with different metadata values for each page in a PDF?
For example, let's say each page could have the fields 'author', 'document reference', 'location', etc... and these have different values on each page. So this example would have on page 1 the author "John Smith" and page 2 would have the author "Jane Simmons" and so on.
The only examples I've seen for PDF metadata all relate to document-wide information - but none for page-only information.
I'm developing in Python.
Thank You. :)
pdf metadata
|
show 2 more comments
up vote
1
down vote
favorite
Is it possible to have the same field names, but with different metadata values for each page in a PDF?
For example, let's say each page could have the fields 'author', 'document reference', 'location', etc... and these have different values on each page. So this example would have on page 1 the author "John Smith" and page 2 would have the author "Jane Simmons" and so on.
The only examples I've seen for PDF metadata all relate to document-wide information - but none for page-only information.
I'm developing in Python.
Thank You. :)
pdf metadata
The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
– Karan
Jun 5 '15 at 9:08
@Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
– Julian Knight
Jun 5 '15 at 9:14
@JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
– Karan
Jun 5 '15 at 9:42
@Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
– Julian Knight
Jun 5 '15 at 10:02
@JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
– Karan
Jun 5 '15 at 10:05
|
show 2 more comments
up vote
1
down vote
favorite
up vote
1
down vote
favorite
Is it possible to have the same field names, but with different metadata values for each page in a PDF?
For example, let's say each page could have the fields 'author', 'document reference', 'location', etc... and these have different values on each page. So this example would have on page 1 the author "John Smith" and page 2 would have the author "Jane Simmons" and so on.
The only examples I've seen for PDF metadata all relate to document-wide information - but none for page-only information.
I'm developing in Python.
Thank You. :)
pdf metadata
Is it possible to have the same field names, but with different metadata values for each page in a PDF?
For example, let's say each page could have the fields 'author', 'document reference', 'location', etc... and these have different values on each page. So this example would have on page 1 the author "John Smith" and page 2 would have the author "Jane Simmons" and so on.
The only examples I've seen for PDF metadata all relate to document-wide information - but none for page-only information.
I'm developing in Python.
Thank You. :)
pdf metadata
pdf metadata
edited Jun 6 '15 at 2:00
asked Jun 5 '15 at 8:56
teracow
85
85
The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
– Karan
Jun 5 '15 at 9:08
@Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
– Julian Knight
Jun 5 '15 at 9:14
@JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
– Karan
Jun 5 '15 at 9:42
@Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
– Julian Knight
Jun 5 '15 at 10:02
@JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
– Karan
Jun 5 '15 at 10:05
|
show 2 more comments
The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
– Karan
Jun 5 '15 at 9:08
@Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
– Julian Knight
Jun 5 '15 at 9:14
@JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
– Karan
Jun 5 '15 at 9:42
@Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
– Julian Knight
Jun 5 '15 at 10:02
@JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
– Karan
Jun 5 '15 at 10:05
The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
– Karan
Jun 5 '15 at 9:08
The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
– Karan
Jun 5 '15 at 9:08
@Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
– Julian Knight
Jun 5 '15 at 9:14
@Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
– Julian Knight
Jun 5 '15 at 9:14
@JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
– Karan
Jun 5 '15 at 9:42
@JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
– Karan
Jun 5 '15 at 9:42
@Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
– Julian Knight
Jun 5 '15 at 10:02
@Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
– Julian Knight
Jun 5 '15 at 10:02
@JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
– Karan
Jun 5 '15 at 10:05
@JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
– Karan
Jun 5 '15 at 10:05
|
show 2 more comments
2 Answers
2
active
oldest
votes
up vote
1
down vote
accepted
It looks as though the standard supports metadata at more than just the document level:
In general, any PDF stream or dictionary may have metadata attached to
it as long as the stream or dictionary represents an actual
information resource, as opposed to serving as an implementation
artifact. Some PDF constructs are considered implementational, and
hence may not have associated metadata.
Clear as mud! Thankfully there are some additional notes. Including:
In addition, metadata may also be associated with marked content
within a content stream. This association shall be created by
including an entry in the property list dictionary whose key shall be
Metadata and whose value shall be the metadata stream dictionary.
Because this construct refers to an object outside the content
stream, the property list is referred to indirectly as a named
resource (see 14.6.2, “Property Lists”).
This means that you can attach metadata to certain artifacts within your document but I don't believe that you can attach them to a specific page, you would have to have an object that you attached the data to - an image would be the obvious example though the standard seems to refer to shadings too.
Of course, although the standard seems to allow it, that doesn't mean that common PDF handling libraries and applications support it.
Adobe's downloadable version of the Standard (will save you CHF200)
I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
– Karan
Jun 5 '15 at 9:43
Thanks @Karan, I've improved the wording to more explicitly answer the question.
– Julian Knight
Jun 5 '15 at 10:00
+1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
– Karan
Jun 5 '15 at 10:15
@JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
– teracow
Jun 7 '15 at 2:53
add a comment |
up vote
0
down vote
PDF pages can have annotations; the most common type of them might be those that are like sticky notes but these are not the only ones. These are described in part 8.4 of the Adobe PDF 1.7 reference. You can create text annotations, name them with keys like "author" and set the contents to the corresponding string values. Then set the hidden flag to true so the annotation is not displayed or allowed to interact with the user. There's a requirement to set a rectangle for the annotation but since it's not going to be displayed any rectangle inside the page should work.
add a comment |
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
1
down vote
accepted
It looks as though the standard supports metadata at more than just the document level:
In general, any PDF stream or dictionary may have metadata attached to
it as long as the stream or dictionary represents an actual
information resource, as opposed to serving as an implementation
artifact. Some PDF constructs are considered implementational, and
hence may not have associated metadata.
Clear as mud! Thankfully there are some additional notes. Including:
In addition, metadata may also be associated with marked content
within a content stream. This association shall be created by
including an entry in the property list dictionary whose key shall be
Metadata and whose value shall be the metadata stream dictionary.
Because this construct refers to an object outside the content
stream, the property list is referred to indirectly as a named
resource (see 14.6.2, “Property Lists”).
This means that you can attach metadata to certain artifacts within your document but I don't believe that you can attach them to a specific page, you would have to have an object that you attached the data to - an image would be the obvious example though the standard seems to refer to shadings too.
Of course, although the standard seems to allow it, that doesn't mean that common PDF handling libraries and applications support it.
Adobe's downloadable version of the Standard (will save you CHF200)
I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
– Karan
Jun 5 '15 at 9:43
Thanks @Karan, I've improved the wording to more explicitly answer the question.
– Julian Knight
Jun 5 '15 at 10:00
+1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
– Karan
Jun 5 '15 at 10:15
@JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
– teracow
Jun 7 '15 at 2:53
add a comment |
up vote
1
down vote
accepted
It looks as though the standard supports metadata at more than just the document level:
In general, any PDF stream or dictionary may have metadata attached to
it as long as the stream or dictionary represents an actual
information resource, as opposed to serving as an implementation
artifact. Some PDF constructs are considered implementational, and
hence may not have associated metadata.
Clear as mud! Thankfully there are some additional notes. Including:
In addition, metadata may also be associated with marked content
within a content stream. This association shall be created by
including an entry in the property list dictionary whose key shall be
Metadata and whose value shall be the metadata stream dictionary.
Because this construct refers to an object outside the content
stream, the property list is referred to indirectly as a named
resource (see 14.6.2, “Property Lists”).
This means that you can attach metadata to certain artifacts within your document but I don't believe that you can attach them to a specific page, you would have to have an object that you attached the data to - an image would be the obvious example though the standard seems to refer to shadings too.
Of course, although the standard seems to allow it, that doesn't mean that common PDF handling libraries and applications support it.
Adobe's downloadable version of the Standard (will save you CHF200)
I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
– Karan
Jun 5 '15 at 9:43
Thanks @Karan, I've improved the wording to more explicitly answer the question.
– Julian Knight
Jun 5 '15 at 10:00
+1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
– Karan
Jun 5 '15 at 10:15
@JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
– teracow
Jun 7 '15 at 2:53
add a comment |
up vote
1
down vote
accepted
up vote
1
down vote
accepted
It looks as though the standard supports metadata at more than just the document level:
In general, any PDF stream or dictionary may have metadata attached to
it as long as the stream or dictionary represents an actual
information resource, as opposed to serving as an implementation
artifact. Some PDF constructs are considered implementational, and
hence may not have associated metadata.
Clear as mud! Thankfully there are some additional notes. Including:
In addition, metadata may also be associated with marked content
within a content stream. This association shall be created by
including an entry in the property list dictionary whose key shall be
Metadata and whose value shall be the metadata stream dictionary.
Because this construct refers to an object outside the content
stream, the property list is referred to indirectly as a named
resource (see 14.6.2, “Property Lists”).
This means that you can attach metadata to certain artifacts within your document but I don't believe that you can attach them to a specific page, you would have to have an object that you attached the data to - an image would be the obvious example though the standard seems to refer to shadings too.
Of course, although the standard seems to allow it, that doesn't mean that common PDF handling libraries and applications support it.
Adobe's downloadable version of the Standard (will save you CHF200)
It looks as though the standard supports metadata at more than just the document level:
In general, any PDF stream or dictionary may have metadata attached to
it as long as the stream or dictionary represents an actual
information resource, as opposed to serving as an implementation
artifact. Some PDF constructs are considered implementational, and
hence may not have associated metadata.
Clear as mud! Thankfully there are some additional notes. Including:
In addition, metadata may also be associated with marked content
within a content stream. This association shall be created by
including an entry in the property list dictionary whose key shall be
Metadata and whose value shall be the metadata stream dictionary.
Because this construct refers to an object outside the content
stream, the property list is referred to indirectly as a named
resource (see 14.6.2, “Property Lists”).
This means that you can attach metadata to certain artifacts within your document but I don't believe that you can attach them to a specific page, you would have to have an object that you attached the data to - an image would be the obvious example though the standard seems to refer to shadings too.
Of course, although the standard seems to allow it, that doesn't mean that common PDF handling libraries and applications support it.
Adobe's downloadable version of the Standard (will save you CHF200)
edited Jun 5 '15 at 9:59
answered Jun 5 '15 at 9:23
Julian Knight
12.9k11535
12.9k11535
I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
– Karan
Jun 5 '15 at 9:43
Thanks @Karan, I've improved the wording to more explicitly answer the question.
– Julian Knight
Jun 5 '15 at 10:00
+1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
– Karan
Jun 5 '15 at 10:15
@JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
– teracow
Jun 7 '15 at 2:53
add a comment |
I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
– Karan
Jun 5 '15 at 9:43
Thanks @Karan, I've improved the wording to more explicitly answer the question.
– Julian Knight
Jun 5 '15 at 10:00
+1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
– Karan
Jun 5 '15 at 10:15
@JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
– teracow
Jun 7 '15 at 2:53
I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
– Karan
Jun 5 '15 at 9:43
I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
– Karan
Jun 5 '15 at 9:43
Thanks @Karan, I've improved the wording to more explicitly answer the question.
– Julian Knight
Jun 5 '15 at 10:00
Thanks @Karan, I've improved the wording to more explicitly answer the question.
– Julian Knight
Jun 5 '15 at 10:00
+1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
– Karan
Jun 5 '15 at 10:15
+1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
– Karan
Jun 5 '15 at 10:15
@JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
– teracow
Jun 7 '15 at 2:53
@JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
– teracow
Jun 7 '15 at 2:53
add a comment |
up vote
0
down vote
PDF pages can have annotations; the most common type of them might be those that are like sticky notes but these are not the only ones. These are described in part 8.4 of the Adobe PDF 1.7 reference. You can create text annotations, name them with keys like "author" and set the contents to the corresponding string values. Then set the hidden flag to true so the annotation is not displayed or allowed to interact with the user. There's a requirement to set a rectangle for the annotation but since it's not going to be displayed any rectangle inside the page should work.
add a comment |
up vote
0
down vote
PDF pages can have annotations; the most common type of them might be those that are like sticky notes but these are not the only ones. These are described in part 8.4 of the Adobe PDF 1.7 reference. You can create text annotations, name them with keys like "author" and set the contents to the corresponding string values. Then set the hidden flag to true so the annotation is not displayed or allowed to interact with the user. There's a requirement to set a rectangle for the annotation but since it's not going to be displayed any rectangle inside the page should work.
add a comment |
up vote
0
down vote
up vote
0
down vote
PDF pages can have annotations; the most common type of them might be those that are like sticky notes but these are not the only ones. These are described in part 8.4 of the Adobe PDF 1.7 reference. You can create text annotations, name them with keys like "author" and set the contents to the corresponding string values. Then set the hidden flag to true so the annotation is not displayed or allowed to interact with the user. There's a requirement to set a rectangle for the annotation but since it's not going to be displayed any rectangle inside the page should work.
PDF pages can have annotations; the most common type of them might be those that are like sticky notes but these are not the only ones. These are described in part 8.4 of the Adobe PDF 1.7 reference. You can create text annotations, name them with keys like "author" and set the contents to the corresponding string values. Then set the hidden flag to true so the annotation is not displayed or allowed to interact with the user. There's a requirement to set a rectangle for the annotation but since it's not going to be displayed any rectangle inside the page should work.
answered Nov 29 at 2:25
etr
1
1
add a comment |
add a comment |
Thanks for contributing an answer to Super User!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f924158%2fcan-each-page-in-a-pdf-contain-its-own-metadata%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
– Karan
Jun 5 '15 at 9:08
@Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
– Julian Knight
Jun 5 '15 at 9:14
@JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
– Karan
Jun 5 '15 at 9:42
@Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
– Julian Knight
Jun 5 '15 at 10:02
@JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
– Karan
Jun 5 '15 at 10:05