Can each page in a PDF contain its own metadata?











up vote
1
down vote

favorite












Is it possible to have the same field names, but with different metadata values for each page in a PDF?



For example, let's say each page could have the fields 'author', 'document reference', 'location', etc... and these have different values on each page. So this example would have on page 1 the author "John Smith" and page 2 would have the author "Jane Simmons" and so on.



The only examples I've seen for PDF metadata all relate to document-wide information - but none for page-only information.



I'm developing in Python.



Thank You. :)










share|improve this question
























  • The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
    – Karan
    Jun 5 '15 at 9:08










  • @Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
    – Julian Knight
    Jun 5 '15 at 9:14












  • @JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
    – Karan
    Jun 5 '15 at 9:42










  • @Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
    – Julian Knight
    Jun 5 '15 at 10:02










  • @JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
    – Karan
    Jun 5 '15 at 10:05















up vote
1
down vote

favorite












Is it possible to have the same field names, but with different metadata values for each page in a PDF?



For example, let's say each page could have the fields 'author', 'document reference', 'location', etc... and these have different values on each page. So this example would have on page 1 the author "John Smith" and page 2 would have the author "Jane Simmons" and so on.



The only examples I've seen for PDF metadata all relate to document-wide information - but none for page-only information.



I'm developing in Python.



Thank You. :)










share|improve this question
























  • The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
    – Karan
    Jun 5 '15 at 9:08










  • @Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
    – Julian Knight
    Jun 5 '15 at 9:14












  • @JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
    – Karan
    Jun 5 '15 at 9:42










  • @Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
    – Julian Knight
    Jun 5 '15 at 10:02










  • @JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
    – Karan
    Jun 5 '15 at 10:05













up vote
1
down vote

favorite









up vote
1
down vote

favorite











Is it possible to have the same field names, but with different metadata values for each page in a PDF?



For example, let's say each page could have the fields 'author', 'document reference', 'location', etc... and these have different values on each page. So this example would have on page 1 the author "John Smith" and page 2 would have the author "Jane Simmons" and so on.



The only examples I've seen for PDF metadata all relate to document-wide information - but none for page-only information.



I'm developing in Python.



Thank You. :)










share|improve this question















Is it possible to have the same field names, but with different metadata values for each page in a PDF?



For example, let's say each page could have the fields 'author', 'document reference', 'location', etc... and these have different values on each page. So this example would have on page 1 the author "John Smith" and page 2 would have the author "Jane Simmons" and so on.



The only examples I've seen for PDF metadata all relate to document-wide information - but none for page-only information.



I'm developing in Python.



Thank You. :)







pdf metadata






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Jun 6 '15 at 2:00

























asked Jun 5 '15 at 8:56









teracow

85




85












  • The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
    – Karan
    Jun 5 '15 at 9:08










  • @Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
    – Julian Knight
    Jun 5 '15 at 9:14












  • @JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
    – Karan
    Jun 5 '15 at 9:42










  • @Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
    – Julian Knight
    Jun 5 '15 at 10:02










  • @JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
    – Karan
    Jun 5 '15 at 10:05


















  • The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
    – Karan
    Jun 5 '15 at 9:08










  • @Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
    – Julian Knight
    Jun 5 '15 at 9:14












  • @JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
    – Karan
    Jun 5 '15 at 9:42










  • @Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
    – Julian Knight
    Jun 5 '15 at 10:02










  • @JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
    – Karan
    Jun 5 '15 at 10:05
















The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
– Karan
Jun 5 '15 at 9:08




The easiest way to confirm is to go right to the source, i.e. the ISO 32000-1 Standards document.
– Karan
Jun 5 '15 at 9:08












@Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
– Julian Knight
Jun 5 '15 at 9:14






@Karan, that is slightly difficult as the actual standard is very expensive - CHF200. Fortunately there are legitimate downloads.
– Julian Knight
Jun 5 '15 at 9:14














@JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
– Karan
Jun 5 '15 at 9:42




@JulianKnight: Meh, cost never occurred to me because I've always obtained it for free (including supplemental changes) from adobe.com/devnet/pdf/pdf_reference.html
– Karan
Jun 5 '15 at 9:42












@Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
– Julian Knight
Jun 5 '15 at 10:02




@Karan - ah, well it was you who mentioned the ISO standard :) It always annoys me that you have to pay so much to get copies of ISO standards.
– Julian Knight
Jun 5 '15 at 10:02












@JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
– Karan
Jun 5 '15 at 10:05




@JulianKnight: I did, but the very first search result for "ISO 32000-1 Standards" is Adobe's PDF so... Re. the cost I'm completely with you. I see no reason why they need to charge so much, especially from individuals and not companies.
– Karan
Jun 5 '15 at 10:05










2 Answers
2






active

oldest

votes

















up vote
1
down vote



accepted










It looks as though the standard supports metadata at more than just the document level:




In general, any PDF stream or dictionary may have metadata attached to
it as long as the stream or dictionary represents an actual
information resource, as opposed to serving as an implementation
artifact. Some PDF constructs are considered implementational, and
hence may not have associated metadata.




Clear as mud! Thankfully there are some additional notes. Including:




In addition, metadata may also be associated with marked content
within a content stream. This association shall be created by
including an entry in the property list dictionary whose key shall be
Metadata and whose value shall be the metadata stream dictionary.
Because this construct refers to an object outside the content
stream, the property list is referred to indirectly as a named
resource (see 14.6.2, “Property Lists”).




This means that you can attach metadata to certain artifacts within your document but I don't believe that you can attach them to a specific page, you would have to have an object that you attached the data to - an image would be the obvious example though the standard seems to refer to shadings too.



Of course, although the standard seems to allow it, that doesn't mean that common PDF handling libraries and applications support it.



Adobe's downloadable version of the Standard (will save you CHF200)






share|improve this answer























  • I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
    – Karan
    Jun 5 '15 at 9:43












  • Thanks @Karan, I've improved the wording to more explicitly answer the question.
    – Julian Knight
    Jun 5 '15 at 10:00










  • +1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
    – Karan
    Jun 5 '15 at 10:15












  • @JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
    – teracow
    Jun 7 '15 at 2:53


















up vote
0
down vote













PDF pages can have annotations; the most common type of them might be those that are like sticky notes but these are not the only ones. These are described in part 8.4 of the Adobe PDF 1.7 reference. You can create text annotations, name them with keys like "author" and set the contents to the corresponding string values. Then set the hidden flag to true so the annotation is not displayed or allowed to interact with the user. There's a requirement to set a rectangle for the annotation but since it's not going to be displayed any rectangle inside the page should work.






share|improve this answer





















    Your Answer








    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "3"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f924158%2fcan-each-page-in-a-pdf-contain-its-own-metadata%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes








    up vote
    1
    down vote



    accepted










    It looks as though the standard supports metadata at more than just the document level:




    In general, any PDF stream or dictionary may have metadata attached to
    it as long as the stream or dictionary represents an actual
    information resource, as opposed to serving as an implementation
    artifact. Some PDF constructs are considered implementational, and
    hence may not have associated metadata.




    Clear as mud! Thankfully there are some additional notes. Including:




    In addition, metadata may also be associated with marked content
    within a content stream. This association shall be created by
    including an entry in the property list dictionary whose key shall be
    Metadata and whose value shall be the metadata stream dictionary.
    Because this construct refers to an object outside the content
    stream, the property list is referred to indirectly as a named
    resource (see 14.6.2, “Property Lists”).




    This means that you can attach metadata to certain artifacts within your document but I don't believe that you can attach them to a specific page, you would have to have an object that you attached the data to - an image would be the obvious example though the standard seems to refer to shadings too.



    Of course, although the standard seems to allow it, that doesn't mean that common PDF handling libraries and applications support it.



    Adobe's downloadable version of the Standard (will save you CHF200)






    share|improve this answer























    • I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
      – Karan
      Jun 5 '15 at 9:43












    • Thanks @Karan, I've improved the wording to more explicitly answer the question.
      – Julian Knight
      Jun 5 '15 at 10:00










    • +1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
      – Karan
      Jun 5 '15 at 10:15












    • @JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
      – teracow
      Jun 7 '15 at 2:53















    up vote
    1
    down vote



    accepted










    It looks as though the standard supports metadata at more than just the document level:




    In general, any PDF stream or dictionary may have metadata attached to
    it as long as the stream or dictionary represents an actual
    information resource, as opposed to serving as an implementation
    artifact. Some PDF constructs are considered implementational, and
    hence may not have associated metadata.




    Clear as mud! Thankfully there are some additional notes. Including:




    In addition, metadata may also be associated with marked content
    within a content stream. This association shall be created by
    including an entry in the property list dictionary whose key shall be
    Metadata and whose value shall be the metadata stream dictionary.
    Because this construct refers to an object outside the content
    stream, the property list is referred to indirectly as a named
    resource (see 14.6.2, “Property Lists”).




    This means that you can attach metadata to certain artifacts within your document but I don't believe that you can attach them to a specific page, you would have to have an object that you attached the data to - an image would be the obvious example though the standard seems to refer to shadings too.



    Of course, although the standard seems to allow it, that doesn't mean that common PDF handling libraries and applications support it.



    Adobe's downloadable version of the Standard (will save you CHF200)






    share|improve this answer























    • I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
      – Karan
      Jun 5 '15 at 9:43












    • Thanks @Karan, I've improved the wording to more explicitly answer the question.
      – Julian Knight
      Jun 5 '15 at 10:00










    • +1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
      – Karan
      Jun 5 '15 at 10:15












    • @JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
      – teracow
      Jun 7 '15 at 2:53













    up vote
    1
    down vote



    accepted







    up vote
    1
    down vote



    accepted






    It looks as though the standard supports metadata at more than just the document level:




    In general, any PDF stream or dictionary may have metadata attached to
    it as long as the stream or dictionary represents an actual
    information resource, as opposed to serving as an implementation
    artifact. Some PDF constructs are considered implementational, and
    hence may not have associated metadata.




    Clear as mud! Thankfully there are some additional notes. Including:




    In addition, metadata may also be associated with marked content
    within a content stream. This association shall be created by
    including an entry in the property list dictionary whose key shall be
    Metadata and whose value shall be the metadata stream dictionary.
    Because this construct refers to an object outside the content
    stream, the property list is referred to indirectly as a named
    resource (see 14.6.2, “Property Lists”).




    This means that you can attach metadata to certain artifacts within your document but I don't believe that you can attach them to a specific page, you would have to have an object that you attached the data to - an image would be the obvious example though the standard seems to refer to shadings too.



    Of course, although the standard seems to allow it, that doesn't mean that common PDF handling libraries and applications support it.



    Adobe's downloadable version of the Standard (will save you CHF200)






    share|improve this answer














    It looks as though the standard supports metadata at more than just the document level:




    In general, any PDF stream or dictionary may have metadata attached to
    it as long as the stream or dictionary represents an actual
    information resource, as opposed to serving as an implementation
    artifact. Some PDF constructs are considered implementational, and
    hence may not have associated metadata.




    Clear as mud! Thankfully there are some additional notes. Including:




    In addition, metadata may also be associated with marked content
    within a content stream. This association shall be created by
    including an entry in the property list dictionary whose key shall be
    Metadata and whose value shall be the metadata stream dictionary.
    Because this construct refers to an object outside the content
    stream, the property list is referred to indirectly as a named
    resource (see 14.6.2, “Property Lists”).




    This means that you can attach metadata to certain artifacts within your document but I don't believe that you can attach them to a specific page, you would have to have an object that you attached the data to - an image would be the obvious example though the standard seems to refer to shadings too.



    Of course, although the standard seems to allow it, that doesn't mean that common PDF handling libraries and applications support it.



    Adobe's downloadable version of the Standard (will save you CHF200)







    share|improve this answer














    share|improve this answer



    share|improve this answer








    edited Jun 5 '15 at 9:59

























    answered Jun 5 '15 at 9:23









    Julian Knight

    12.9k11535




    12.9k11535












    • I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
      – Karan
      Jun 5 '15 at 9:43












    • Thanks @Karan, I've improved the wording to more explicitly answer the question.
      – Julian Knight
      Jun 5 '15 at 10:00










    • +1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
      – Karan
      Jun 5 '15 at 10:15












    • @JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
      – teracow
      Jun 7 '15 at 2:53


















    • I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
      – Karan
      Jun 5 '15 at 9:43












    • Thanks @Karan, I've improved the wording to more explicitly answer the question.
      – Julian Knight
      Jun 5 '15 at 10:00










    • +1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
      – Karan
      Jun 5 '15 at 10:15












    • @JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
      – teracow
      Jun 7 '15 at 2:53
















    I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
    – Karan
    Jun 5 '15 at 9:43






    I read this and as you said it's "clear as mud". Still no sure whether all the metadata fields the OP wants can be stored per page. Plus of course you're absolutely right, finding apps that properly conform to the specs is not easy.
    – Karan
    Jun 5 '15 at 9:43














    Thanks @Karan, I've improved the wording to more explicitly answer the question.
    – Julian Knight
    Jun 5 '15 at 10:00




    Thanks @Karan, I've improved the wording to more explicitly answer the question.
    – Julian Knight
    Jun 5 '15 at 10:00












    +1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
    – Karan
    Jun 5 '15 at 10:15






    +1 Based on the official specs I think this is the best we can get, unless some PDF expert can prove otherwise (that per page metadata of the sort wanted by the OP is indeed possible).
    – Karan
    Jun 5 '15 at 10:15














    @JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
    – teracow
    Jun 7 '15 at 2:53




    @JulianKnight - yes, this might be what I'm looking for. Each page consists of a scanned image. I'm hoping to store some easily searchable reference info along with each image. Thanks. :)
    – teracow
    Jun 7 '15 at 2:53












    up vote
    0
    down vote













    PDF pages can have annotations; the most common type of them might be those that are like sticky notes but these are not the only ones. These are described in part 8.4 of the Adobe PDF 1.7 reference. You can create text annotations, name them with keys like "author" and set the contents to the corresponding string values. Then set the hidden flag to true so the annotation is not displayed or allowed to interact with the user. There's a requirement to set a rectangle for the annotation but since it's not going to be displayed any rectangle inside the page should work.






    share|improve this answer

























      up vote
      0
      down vote













      PDF pages can have annotations; the most common type of them might be those that are like sticky notes but these are not the only ones. These are described in part 8.4 of the Adobe PDF 1.7 reference. You can create text annotations, name them with keys like "author" and set the contents to the corresponding string values. Then set the hidden flag to true so the annotation is not displayed or allowed to interact with the user. There's a requirement to set a rectangle for the annotation but since it's not going to be displayed any rectangle inside the page should work.






      share|improve this answer























        up vote
        0
        down vote










        up vote
        0
        down vote









        PDF pages can have annotations; the most common type of them might be those that are like sticky notes but these are not the only ones. These are described in part 8.4 of the Adobe PDF 1.7 reference. You can create text annotations, name them with keys like "author" and set the contents to the corresponding string values. Then set the hidden flag to true so the annotation is not displayed or allowed to interact with the user. There's a requirement to set a rectangle for the annotation but since it's not going to be displayed any rectangle inside the page should work.






        share|improve this answer












        PDF pages can have annotations; the most common type of them might be those that are like sticky notes but these are not the only ones. These are described in part 8.4 of the Adobe PDF 1.7 reference. You can create text annotations, name them with keys like "author" and set the contents to the corresponding string values. Then set the hidden flag to true so the annotation is not displayed or allowed to interact with the user. There's a requirement to set a rectangle for the annotation but since it's not going to be displayed any rectangle inside the page should work.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Nov 29 at 2:25









        etr

        1




        1






























            draft saved

            draft discarded




















































            Thanks for contributing an answer to Super User!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            To learn more, see our tips on writing great answers.





            Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


            Please pay close attention to the following guidance:


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f924158%2fcan-each-page-in-a-pdf-contain-its-own-metadata%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            "Incorrect syntax near the keyword 'ON'. (on update cascade, on delete cascade,)

            Alcedinidae

            Origin of the phrase “under your belt”?