catdoc
Details
| Size: | 0K |
| Last Update: | 2008-05-10 02:10:34 |
| Version: | 0.94 |
| OS Support: | Linux |
| License/Program Type: | GPL (GNU General Public License) |
| Publisher: | Victor Wagner |
| Price: | $0.00 |
Description:
catdoc 0.94 is utilities software developed by Victor Wagner.
catdoc is program which reads one or more Microsoft word files and outputs text, contained insinde them to standard output. Therefore it does same work for .doc files, as unix cat command for plain ASCII files.
It is now accompanied by xls2csv - program which converts Excel spreadsheet into comma-separated value file. Newest addition to catdoc suite is catppt - program, which extracts readable text from the PowerPoint files.
Optionaly, catdoc is able to translate some non-ASCII chars into correspoindig TeX escape sequences and convert charsets from Windows ANSI codepage or unicode to local codepage of target machine.
It also have database of substitution sequences which are used for symbols which are not present in the target encoding. So if you are trying to read Russian word file under C locale, you'll get a transliteration.
Under Unix it uses nl_langinfo function to find out which output encoding to use, under DOS it uses appropriate DOS function, which gets codepage value from the COUNTRY statement in config.sys.
catdoc is also able to read RTF files and even plain text, so it can be used as general-purpose encoding convertor. (Because catdoc is russian program, by default it converts cp1251 to koi8-r, when running under UNIX and to cp866 when running under DOS.
Catdoc has rudimentary table handling. In TeX mode it inserts & when encounters field delimiter and when encounters end of table row. No table headers are produced although.
Catdoc doesn't even try to preserver MS-Word character formatting. It's goal is to extract plain text and allow you to read it and, probably, reformat with TeX, according to TeXnical rules, most Word users haven't even heard about.
xls2csv does roughly same for Excel files. It extracts data and leaves out any formatting info and formulas. Concept is that you want to see data, not the way it was created.
There is tcl/tk GUI script wordview which provides GUI for viewing Word and RTF files using catdoc. Since internal representation of Tcl string is utf-8 and most systems now have unicode fonts, you'll probably be able to read document in any language using this script.
catdoc 0.94 supports english interface languages and works with Linux.
Downloading catdoc 0.94 will take if you use fast ADSL connection.
0 comments
Add to
catdoc Version History
Related Software
|
|
From category: Utilities |
| delsafe 0.3.2 is utilities software developed by Paulo Silva. delsafe is a set of utilities to hopefully allow you to recover recently deleted files. Basically, when you delete or in certain cases... |
|
|
From category: Backup |
| Genie Backup Manager Professional is a very easy to use yet powerful software that can backup and restore files, documents, emails, settings, programs and more to virtually any local or remote device... |
|
|
From category: Utilities |
| KAlarm is a personal alarm message, command and email scheduler.... |
|
|
From category: Backup |
| 1 - Handy Data Recovery
Handy Data Recovery is an easy-to-use data recovery software designed to restore files accidentally deleted from hard disks and floppy drives. The program can recover file... |
|
|
From category: Utilities |
| DDD 3.3.1 is utilities software developed by Andreas Zeller. GNU DDD is a graphical front-end for command-line debuggers such as GDB, JDB, DBX, WDB, Ladebug, XDB, the Perl debugger, the bash debugg... |
|
|
From category: Utilities |
| DidiWiki 0.5 is utilities software developed by Matthew Allum. DidiWiki is a small and simple WikiWikiWeb implementation written in C. Its intended for personal use for notes, Todo\'s etc. It inclu... |
|
|
From category: Utilities |
| DynaStop is a gpl licensed LINUX utility to examine IP4 based addresses for Exim.... |
|
|
From category: Utilities |
| CyberBrau 0.9.4 is utilities software developed by Phil. Cyberbrau is a web based program to help the homebrewer. CyberBrau project allows for very simple and intuitive recipe creation, and it auto... |
|
|
From category: Utilities |
| random is a non-determinisitic random number for GNU R.... |
|
|
From category: Utilities |
| configsaver 0.2 is utilities software developed by Chris AtLee. configsaver will synchronize your personal configuration files (such as ~/.vimrc, ~/.bashrc, etc.) using your gmail account. When... |
|
|
From category: Other-Tools |
| versatile data and function plotting flexible data reading/writing in different formats (including cdf, netcdf, audio, binary, images) reading of images (over 80 image formats) and compres... |
|
|
From category: Linux-Distributions |
| Damn Small is small enough and smart enough to do the following things: -Boot from a business card CD as a live linux distribution (LiveCD). -Boot from a USB pen drive -Boot from within a host opera... |
|
|
From category: Utilities |
| FedUp 0.4 is utilities software developed by Terence Lee. FedUp is a package tracker for tracking shipped packages (i.e. FedEx, DHL, UPS, USPS) Package is written using the mono framework.... |
|
|
From category: Backup |
| Genie Backup Manager is a very easy to use yet powerful software that can backup and restore files, documents, emails, settings, programs and more to virtually storage location, including hard disks,... |
|
|
From category: Utilities |
| All in One 0.5.0 is utilities software developed by Mario Pascucci. All in One is a dock applet for Fluxbox and similar Window Managers. It shows: memory usage: main, buffers, shared... |
Leave a comment