Automatic extraction of individual html pages from a Website
Thread poster: Noemi Carrera
Noemi Carrera
Noemi Carrera  Identity Verified
Spain
Local time: 21:10
Member (2003)
English to Spanish
May 9, 2006

Hi everyone,

I need to translate a Website that consists of lots of html pages. The client has not provided us with these pages, just with the .doc files.

I would prefer to work in TagEditor because there is a lot of formatting and tables everywhere and was wondering if there was any software that allows to extract automatically all the individual html pages from a Website.

Thank you very much in advance!

Best regards,

Noemí


 
Robert Tucker (X)
Robert Tucker (X)
United Kingdom
Local time: 20:10
German to English
+ ...
wget May 9, 2006

Originally written for Unix there are now Windows versions. You may want to search the net for a version you like the look of most; I found this one:

http://users.ugent.be/~bpuype/wget/

There is other software for the task, but I still find wget the easiest to use even though it is command line.


 
tlmurray (X)
tlmurray (X)
Local time: 16:10
English
Acrobat, others May 9, 2006

Acrobat (Pro, at least) will dredge through an entire site and make a PDF of each page.

If you're fortunate to have a Mac, Webstractor (softchaos.com) pulls pages into a document that allows editing right there, sort of like viewing a page "in Word". There may be similar tools in Windows.

I noticed you said the client gave you the .doc files. Do you mean that the Web site is made from Word-to-Web, and you have the native docs? Because that sounds like you're home fre
... See more
Acrobat (Pro, at least) will dredge through an entire site and make a PDF of each page.

If you're fortunate to have a Mac, Webstractor (softchaos.com) pulls pages into a document that allows editing right there, sort of like viewing a page "in Word". There may be similar tools in Windows.

I noticed you said the client gave you the .doc files. Do you mean that the Web site is made from Word-to-Web, and you have the native docs? Because that sounds like you're home free for translating...
Collapse


 
Maria Asis
Maria Asis  Identity Verified
Spain
Local time: 21:10
Member (2002)
English to Spanish
+ ...
Try WinHTTrack Website Copier May 9, 2006

Hi!

I'm a great fan of WinHTTrack!

Find it here: http://www.httrack.com/

Luck!

MJ


 
Maria Asis
Maria Asis  Identity Verified
Spain
Local time: 21:10
Member (2002)
English to Spanish
+ ...
May 9, 2006



[Edited at 2006-05-09 21:38]


 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Laureana Pavon[Call to this topic]

You can also contact site staff by submitting a support request »

Automatic extraction of individual html pages from a Website






Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »
TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »