How can I efficiently detect all blank pages in a PDF?
My current solution is opening "the Page Thumbnails pane on the left", and spotting the blank pages. What's a more efficient solution, since this Jul 2015 post?
I'm not asking about deleting the blank pages automatically; I must check myself if the pages are blank before deleting.
add a comment |
My current solution is opening "the Page Thumbnails pane on the left", and spotting the blank pages. What's a more efficient solution, since this Jul 2015 post?
I'm not asking about deleting the blank pages automatically; I must check myself if the pages are blank before deleting.
You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.
– fixer1234
May 31 '18 at 0:58
@fixer1234 Thanks. I updated my post to clarify.
– Greek - Area 51 Proposal
Aug 24 '18 at 2:00
add a comment |
My current solution is opening "the Page Thumbnails pane on the left", and spotting the blank pages. What's a more efficient solution, since this Jul 2015 post?
I'm not asking about deleting the blank pages automatically; I must check myself if the pages are blank before deleting.
My current solution is opening "the Page Thumbnails pane on the left", and spotting the blank pages. What's a more efficient solution, since this Jul 2015 post?
I'm not asking about deleting the blank pages automatically; I must check myself if the pages are blank before deleting.
edited Jan 15 at 6:50
Greek - Area 51 Proposal
asked May 30 '18 at 2:29
Greek - Area 51 ProposalGreek - Area 51 Proposal
44832752
44832752
You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.
– fixer1234
May 31 '18 at 0:58
@fixer1234 Thanks. I updated my post to clarify.
– Greek - Area 51 Proposal
Aug 24 '18 at 2:00
add a comment |
You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.
– fixer1234
May 31 '18 at 0:58
@fixer1234 Thanks. I updated my post to clarify.
– Greek - Area 51 Proposal
Aug 24 '18 at 2:00
You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.
– fixer1234
May 31 '18 at 0:58
You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.
– fixer1234
May 31 '18 at 0:58
@fixer1234 Thanks. I updated my post to clarify.
– Greek - Area 51 Proposal
Aug 24 '18 at 2:00
@fixer1234 Thanks. I updated my post to clarify.
– Greek - Area 51 Proposal
Aug 24 '18 at 2:00
add a comment |
1 Answer
1
active
oldest
votes
As a start, it it quite difficult to determine if a page actually is blank. For example, there might be objects visible outside of the PDF box of your choice (like trimming marks), or object layers which can be switched on/off conditionally.
For a programmatic solution, you could either
use a library like
PoDoFoto load the PDF in question into memory, analyze each page object stream, and look for any painting commands/embedded XObjects.or (on MacOS) use
CGContextDrawPDFPage(Core Graphics) to render each page - a low resolution like 72dpi should do -, and to scan for non-white pixels in the resulting image.
add a comment |
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "3"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f1327038%2fhow-can-i-efficiently-detect-all-blank-pages-in-a-pdf%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
As a start, it it quite difficult to determine if a page actually is blank. For example, there might be objects visible outside of the PDF box of your choice (like trimming marks), or object layers which can be switched on/off conditionally.
For a programmatic solution, you could either
use a library like
PoDoFoto load the PDF in question into memory, analyze each page object stream, and look for any painting commands/embedded XObjects.or (on MacOS) use
CGContextDrawPDFPage(Core Graphics) to render each page - a low resolution like 72dpi should do -, and to scan for non-white pixels in the resulting image.
add a comment |
As a start, it it quite difficult to determine if a page actually is blank. For example, there might be objects visible outside of the PDF box of your choice (like trimming marks), or object layers which can be switched on/off conditionally.
For a programmatic solution, you could either
use a library like
PoDoFoto load the PDF in question into memory, analyze each page object stream, and look for any painting commands/embedded XObjects.or (on MacOS) use
CGContextDrawPDFPage(Core Graphics) to render each page - a low resolution like 72dpi should do -, and to scan for non-white pixels in the resulting image.
add a comment |
As a start, it it quite difficult to determine if a page actually is blank. For example, there might be objects visible outside of the PDF box of your choice (like trimming marks), or object layers which can be switched on/off conditionally.
For a programmatic solution, you could either
use a library like
PoDoFoto load the PDF in question into memory, analyze each page object stream, and look for any painting commands/embedded XObjects.or (on MacOS) use
CGContextDrawPDFPage(Core Graphics) to render each page - a low resolution like 72dpi should do -, and to scan for non-white pixels in the resulting image.
As a start, it it quite difficult to determine if a page actually is blank. For example, there might be objects visible outside of the PDF box of your choice (like trimming marks), or object layers which can be switched on/off conditionally.
For a programmatic solution, you could either
use a library like
PoDoFoto load the PDF in question into memory, analyze each page object stream, and look for any painting commands/embedded XObjects.or (on MacOS) use
CGContextDrawPDFPage(Core Graphics) to render each page - a low resolution like 72dpi should do -, and to scan for non-white pixels in the resulting image.
answered May 30 '18 at 11:49
jvbjvb
1,150413
1,150413
add a comment |
add a comment |
Thanks for contributing an answer to Super User!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f1327038%2fhow-can-i-efficiently-detect-all-blank-pages-in-a-pdf%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
You could speed up a manual process by displaying a grid of page thumbnails, say a 5 high x 10 wide grid, so you could see 50 pages at a time. It would take just seconds to screen them, and delete blanks.
– fixer1234
May 31 '18 at 0:58
@fixer1234 Thanks. I updated my post to clarify.
– Greek - Area 51 Proposal
Aug 24 '18 at 2:00