Trying to extract text out of PDF document using TET 2.3. Many of the PDF's result in having sentences without spaces between words. It seems to happen when...
... This may happen in some cases where the either the font metrics are strange or the word spacing is very tight. ... The artificial "font_#" names are...
Using: php5, PDFlib-Lite-7.0.4p1 Note: Whoever can post the answer that works first, I'll send you a $10 Amazon gift card... I just need an answer fast....
we have a whole lot of PDF generated from hardcopy paper records. These records are scanned through high end scanners and have OCR engines run on them. The...
Hi All, Is it possible using pdf_fit_textflow to return the coordinates of a specific character? For example can I find out the coordinates of the first ascii...
John, ... you might mark the character with a matchbox. Afterwards you can retrieve the position of the matchbox with PDF_info_matchbox(). We demonstrate this...
Hey Folks. I have the following situation with PDFlib 7.0.3p5: I have a text with soft hyphens that I want to put into a textflow. But the text shall not be...
We are creating a pdf using a TIFF graphic (byte-order Macintosh, no ICC profile 8bit/channel, CMYK, 1200dpi) that had only cyan elements. When the PDF that is...
Look at Xerox Job Ticketing options. Some years ago I used them on big xerox printers. I don't remember if RIP was Creo or DocuSP... perhaps both... The...
Fabrizio Accatino
fhtino@...
Jun 11, 2009 8:43 am
19647
... With an ICC profile attached to the image this would be expected behavior since a device- or process-specific mixture of CMYK colors is required to achieve...
... When you save the file with Acrobat it will generate the required appearance streams. If you generate the PDF with PDFlib you should avoid form fields...
Hello, I searched online and in this group, but did not find an answer, so I am asking. We are currently using PDFLib 7, and are running it on Sun Solaris 5.10...
Fabrizio, %XRX>blah blah blah< are Xerox header commands. You can add one, and only one, to the beginning of a .PDF with, in most cases, little damage to the...
Douglas Hanna
douglas.hanna@...
Jun 11, 2009 3:56 pm
19651
Doug, I worked with %XRX 4 years ago... I don't remember exactly every details but I'm sure: I inserted a lot of them at the beginning of the pdf. My problem...
Fabrizio Accatino
fhtino@...
Jun 12, 2009 6:43 am
19652
Hello. I am struggling with a (to me) very wired problem. In one pdf document, I use the font Futura Book (open type), loaded with encoding set to auto....
... This looks like the "periodcentered" character instead of the "bullet" character. ... This is not the proper UTF-8 sequence for a bullet character, which...
... The text comes from a database (hex value for this particular character: E2 80 A2 according to the hex viewer in my SQL application). So, yes, I am...
... I just realized that you work with "auto" encoding, which translates to iso8859-1 on Mac OS X. However, the "bullet" character sits on U+2022, which is not...
... Hmm... any example in the documentation maybe? I've set the encoding to 'unicode' when loading the font and replaced the "bullet" character with ...
we've got at least 10 xrx lines at the top of our files, and they don't
break in acrobat or reader when we open those files. I wonder what the
"cutoff" is...
so, we did a quick test, and it looks like for us, 19 lines of xrx is
the max. on 20 the pdf would no longer open
________________________________
From:...
All,
What are the hip kids using nowadays for automated / programmatic pdf
preflighting / inspection? I've used pdfspy in the past and just want to
know if...
I don't remember exactly... it should be 1024 bytes... but I don't remember if it's an Acrobat limit or a value from the Pdf Standard Reference. In the past,...
Fabrizio Accatino
fhtino@...
Jun 13, 2009 7:50 am
19661
Hi, please contact our support (support@...) and attach the font to the e-mail. Kurt ... -- ...
Kurt Stützer
kurt@...
Jun 15, 2009 11:33 am
19662
Hi Andreas, I don't understand what you expect exactly. As far as I can see you don't expect an hyphenation after "Blind-" but after "paradies-". What is the...
Kurt Stützer
kurt@...
Jun 15, 2009 1:00 pm
19663
Hi Kurt. Thanks for the answer. The criterion for hyphenating 'paradisematisches' but not 'Blindtexte' would be, that the first one does not fit into the line,...
Hi friends, I've developed web application using pdflib in windows environment. It is working fine in Websphere application server. Now I need to deploy my web...
Hi Andreas, Thanks for the answer. What I mean is the difference (in the second fitbox of which I assume that shows the desired text formatting) between wohnen...