How can I efficiently detect all blank pages in a PDF?












-1















My current solution is opening "the Page Thumbnails pane on the left", and spotting the blank pages. What's a more efficient solution, since this Jul 2015 post?



I'm not asking about deleting the blank pages automatically; I must check myself if the pages are blank before deleting.










share|improve this question

























  • You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.

    – fixer1234
    May 31 '18 at 0:58











  • @fixer1234 Thanks. I updated my post to clarify.

    – Greek - Area 51 Proposal
    Aug 24 '18 at 2:00
















-1















My current solution is opening "the Page Thumbnails pane on the left", and spotting the blank pages. What's a more efficient solution, since this Jul 2015 post?



I'm not asking about deleting the blank pages automatically; I must check myself if the pages are blank before deleting.










share|improve this question

























  • You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.

    – fixer1234
    May 31 '18 at 0:58











  • @fixer1234 Thanks. I updated my post to clarify.

    – Greek - Area 51 Proposal
    Aug 24 '18 at 2:00














-1












-1








-1








My current solution is opening "the Page Thumbnails pane on the left", and spotting the blank pages. What's a more efficient solution, since this Jul 2015 post?



I'm not asking about deleting the blank pages automatically; I must check myself if the pages are blank before deleting.










share|improve this question
















My current solution is opening "the Page Thumbnails pane on the left", and spotting the blank pages. What's a more efficient solution, since this Jul 2015 post?



I'm not asking about deleting the blank pages automatically; I must check myself if the pages are blank before deleting.







pdf






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Jan 15 at 6:50







Greek - Area 51 Proposal

















asked May 30 '18 at 2:29









Greek - Area 51 ProposalGreek - Area 51 Proposal

44832752




44832752













  • You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.

    – fixer1234
    May 31 '18 at 0:58











  • @fixer1234 Thanks. I updated my post to clarify.

    – Greek - Area 51 Proposal
    Aug 24 '18 at 2:00



















  • You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.

    – fixer1234
    May 31 '18 at 0:58











  • @fixer1234 Thanks. I updated my post to clarify.

    – Greek - Area 51 Proposal
    Aug 24 '18 at 2:00

















You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.

– fixer1234
May 31 '18 at 0:58





You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.

– fixer1234
May 31 '18 at 0:58













@fixer1234 Thanks. I updated my post to clarify.

– Greek - Area 51 Proposal
Aug 24 '18 at 2:00





@fixer1234 Thanks. I updated my post to clarify.

– Greek - Area 51 Proposal
Aug 24 '18 at 2:00










1 Answer
1






active

oldest

votes


















2














As a start, it it quite difficult to determine if a page actually is blank. For example, there might be objects visible outside of the PDF box of your choice (like trimming marks), or object layers which can be switched on/off conditionally.



For a programmatic solution, you could either




  • use a library like PoDoFo to load the PDF in question into memory, analyze each page object stream, and look for any painting commands/embedded XObjects.


  • or (on MacOS) use CGContextDrawPDFPage (Core Graphics) to render each page - a low resolution like 72dpi should do -, and to scan for non-white pixels in the resulting image.







share|improve this answer























    Your Answer








    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "3"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f1327038%2fhow-can-i-efficiently-detect-all-blank-pages-in-a-pdf%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    2














    As a start, it it quite difficult to determine if a page actually is blank. For example, there might be objects visible outside of the PDF box of your choice (like trimming marks), or object layers which can be switched on/off conditionally.



    For a programmatic solution, you could either




    • use a library like PoDoFo to load the PDF in question into memory, analyze each page object stream, and look for any painting commands/embedded XObjects.


    • or (on MacOS) use CGContextDrawPDFPage (Core Graphics) to render each page - a low resolution like 72dpi should do -, and to scan for non-white pixels in the resulting image.







    share|improve this answer




























      2














      As a start, it it quite difficult to determine if a page actually is blank. For example, there might be objects visible outside of the PDF box of your choice (like trimming marks), or object layers which can be switched on/off conditionally.



      For a programmatic solution, you could either




      • use a library like PoDoFo to load the PDF in question into memory, analyze each page object stream, and look for any painting commands/embedded XObjects.


      • or (on MacOS) use CGContextDrawPDFPage (Core Graphics) to render each page - a low resolution like 72dpi should do -, and to scan for non-white pixels in the resulting image.







      share|improve this answer


























        2












        2








        2







        As a start, it it quite difficult to determine if a page actually is blank. For example, there might be objects visible outside of the PDF box of your choice (like trimming marks), or object layers which can be switched on/off conditionally.



        For a programmatic solution, you could either




        • use a library like PoDoFo to load the PDF in question into memory, analyze each page object stream, and look for any painting commands/embedded XObjects.


        • or (on MacOS) use CGContextDrawPDFPage (Core Graphics) to render each page - a low resolution like 72dpi should do -, and to scan for non-white pixels in the resulting image.







        share|improve this answer













        As a start, it it quite difficult to determine if a page actually is blank. For example, there might be objects visible outside of the PDF box of your choice (like trimming marks), or object layers which can be switched on/off conditionally.



        For a programmatic solution, you could either




        • use a library like PoDoFo to load the PDF in question into memory, analyze each page object stream, and look for any painting commands/embedded XObjects.


        • or (on MacOS) use CGContextDrawPDFPage (Core Graphics) to render each page - a low resolution like 72dpi should do -, and to scan for non-white pixels in the resulting image.








        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered May 30 '18 at 11:49









        jvbjvb

        1,150413




        1,150413






























            draft saved

            draft discarded




















































            Thanks for contributing an answer to Super User!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f1327038%2fhow-can-i-efficiently-detect-all-blank-pages-in-a-pdf%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            How to make a Squid Proxy server?

            第一次世界大戦

            Touch on Surface Book