Status of USR Doc conversion (was Re: New vgetty release)
"Robert J. Brown" (rj@eli.elilabs.com)
Thu, 2 Apr 1998 21:12:50 +0200
>>>>> "dthumim" == dthumim <dthumim@alum.mit.edu> writes:
dthumim> From: Wes Brown <wes@prozac.eeap.cwru.edu> Date: Thu,
dthumim> 2 Apr 1998 08:11:16 -0500
dthumim> I have the file. I managed to convince M$word to save
dthumim> the file as an RTF document. It has been run through
dthumim> RTFtoHTML. Now the thing is a mess. I am correcting the
dthumim> layout as best HTML will allow, and converting everything
dthumim> to text.
dthumim> You might want to check out demoronizer, a tool for
dthumim> fixing broken Microsoft HTML. I haven't used it myself,
dthumim> so don't ask me for help, and I don't know if the
dthumim> RTFtoHTML filter you mention has the same problems, but
dthumim> you can read about it at:
dthumim> http://www.fourmilab.ch/webtools/demoronizer/
No, the problem he is facing is broken RTF -- Microsoft *CHANGES* the
definition of RTF with every new release of msword. This makes it
impossible for non-M$ programmers to keep utilities up to date. The
best advice would be to load the msword doc intoan *OLDER* version of
word and then save that as the RTF. That would probably generate the
definition of RTF that the rtf2html filter is expecting.
--
-------- "And there came a writing to him from Elijah" [2Ch 21:12] --------
Robert Jay Brown III rj@eli.elilabs.com http://www.elilabs.com 1 847 705-0424
Elijah Laboratories Inc.; 37 South Greenwood Avenue; Palatine, IL 60067-6328
----- M o d e l i n g t h e M e t h o d s o f t h e M i n d ------