Status of USR Doc conversion (was Re: New vgetty release)

"Robert J. Brown" (rj@eli.elilabs.com)
Thu, 2 Apr 1998 21:12:50 +0200


>>>>> "dthumim" == dthumim  <dthumim@alum.mit.edu> writes:

    dthumim>    From: Wes Brown <wes@prozac.eeap.cwru.edu> Date: Thu,
    dthumim> 2 Apr 1998 08:11:16 -0500

    dthumim>    I have the file.  I managed to convince M$word to save
    dthumim> the file as an RTF document.  It has been run through
    dthumim> RTFtoHTML.  Now the thing is a mess.  I am correcting the
    dthumim> layout as best HTML will allow, and converting everything
    dthumim> to text.

    dthumim> You might want to check out demoronizer, a tool for
    dthumim> fixing broken Microsoft HTML.  I haven't used it myself,
    dthumim> so don't ask me for help, and I don't know if the
    dthumim> RTFtoHTML filter you mention has the same problems, but
    dthumim> you can read about it at:

    dthumim> http://www.fourmilab.ch/webtools/demoronizer/

No, the problem he is facing is broken RTF -- Microsoft *CHANGES* the
definition of RTF with every new release of msword.  This makes it
impossible for non-M$ programmers to keep utilities up to date.  The
best advice would be to load the msword doc intoan *OLDER* version of
word and then save that as the RTF.  That would probably generate the
definition of RTF that the rtf2html filter is expecting.

-- 
--------  "And there came a writing to him from Elijah"  [2Ch 21:12]  --------
Robert Jay Brown III rj@eli.elilabs.com  http://www.elilabs.com 1 847 705-0424
Elijah Laboratories Inc.;  37 South Greenwood Avenue;  Palatine, IL 60067-6328
-----  M o d e l i n g   t h e   M e t h o d s   o f   t h e   M i n d  ------