PDF conversion Autor de la hebra: Louise Mawbey
|
Louise Mawbey Alemania Local time: 01:55 Miembro 2006 alemán al inglés
There are a few threads on this subject in the various forums but they are quite old. Maybe there are some better options now.
What is the best tool for converting PDFs into Word so that I can translate using Studio? Some of the PDFs I have to translate are scans of certificates etc. that contain tricky formatting, such as columns, tables etc.
I've tried using the option in Word itself and the option in Studio but there are so many formatting issues that I really need s... See more There are a few threads on this subject in the various forums but they are quite old. Maybe there are some better options now.
What is the best tool for converting PDFs into Word so that I can translate using Studio? Some of the PDFs I have to translate are scans of certificates etc. that contain tricky formatting, such as columns, tables etc.
I've tried using the option in Word itself and the option in Studio but there are so many formatting issues that I really need something better.
Any tips would be gratefully received.
[Edited at 2022-05-18 07:01 GMT] ▲ Collapse | | |
Samuel Murray Países Bajos Local time: 01:55 Miembro 2006 inglés al afrikaans + ... Studio itself, or manually | May 17, 2022 |
Louise Mawbey wrote:
What is the best tool for converting PDFs into Word so that I can translate using Studio?
In my experience, Studio's own conversion is better than that of any OCR program I've tried.
Some of the PDFs I have to translate are scans of certificates etc. that contain tricky formatting, such as columns, tables etc.
There comes a point at which the PDF is so unconvertable that you just have to recreate it manually, in Word. When I translate diplomas etc., I take a screenshot of the file, add it as a watermark in Word, then retype the source text and position it over the watermark, and then remove the watermark. | | |
neilmac España Local time: 01:55 español al inglés + ...
I use Nitro Pro, which works for most PDFs, but not the worst, terribly clunky and incompatible kind.
And I don't know about Studio, which is anathema to me. | | |
Andriy Yasharov Ucrania Local time: 02:55 Miembro 2008 inglés al ruso + ...
|
|
Stepan Konev Federación Rusa Local time: 03:55 inglés al ruso Solid Documents Technology | May 17, 2022 |
Studio uses Solid Converter blindly. It means that you can ocr a document with Solid Converter and then import the output as is into Studio. The effect will be the same. A better option could be using a stand-alone OCR app, then tidy up your document manually (or build it from scratch) and only then import it into Studio. This is what they recommended at rws community for better OCR output. | | |
Jorge Payan Colombia Local time: 19:55 Miembro 2002 alemán al español + ... My work flow for scanned PDFs | May 17, 2022 |
ABBYY Finereader -> Transtools -> Studio | | |
John Fossey Canadá Local time: 19:55 Miembro 2008 francés al inglés + ... ABBYY Finereader | May 17, 2022 |
It's quite expensive, but I use ABBYY Finereader, which can make outstanding conversions of most PDFs to Word. Its system of manual zoning of text, table and image areas, as well as the ability to place text over an image makes it very versatile. | | |
expressisverbis Portugal Local time: 00:55 Miembro 2015 inglés al portugués + ...
|
|
Louise Mawbey Alemania Local time: 01:55 Miembro 2006 alemán al inglés PERSONA QUE INICIÓ LA HEBRA
Thanks for all the input. I'll try those solutions out and report back | | |
Foxit PhantomPDF is the best | May 19, 2022 |
Very careful in creating Word docs, in my experience. Much better results than with many other brands. | | |
Abby vs Online OCR | May 22, 2022 |
Abbyy Finereader is very good for isolating various parts of documents, but it tends to get complex tables and combinations of texts and images wrong (a mix of tables and overlapping boxes, specially too many independent boxes spread all over the place).
Online OCR has been giving me the best results overall, plus it's free. I haven't read their Terms of Service and Privacy Policy, but I would be very careful when submitting sensitive documents.
[Edited at 2022-05-22 00:14 GMT] | | |
expressisverbis Portugal Local time: 00:55 Miembro 2015 inglés al portugués + ...
Mario Cerutti wrote:
I haven't read their Terms of Service and Privacy Policy, but I would be very careful when submitting sensitive documents.
[Edited at 2022-05-22 00:14 GMT]
"Secure conversion
All documents uploaded under the free "Guest" account will be deleted automatically after conversion. Output files for registered users are stored one month"
https://www.onlineocr.net/
Privacy Policy
We will not view the files that you upload using the OnlineOCR.net service. We may view your file`s information (file extensions, sizes etc. but not your file contents) to provide technical support.
https://www.onlineocr.net/service/privacypolicy
In the past, I used it rarely, as a guest, and I wasn't registered with OnlineOCR.net.
And, yes, I am very careful. The software I use is Abbyy, and I know Foxit and PDFElement deliver also good results. | | |