GutenMark
Details
| Last Update: | 2008-06-05 21:45:29 |
| Version: | 20080601 |
| License/Program Type: | GPL (GNU General Public License) |
| Publisher: | Ronald S. Burkey |
| Price: | $0.00 |
Description:
GutenMark is a Project Gutenberg markup tool.
GutenMark is a command-line tool for automatically creating high-quality HTML or LaTeX markup from Project Gutenberg etexts. As of April 2008, there is also a graphical front-end called GUItenMark that greatly simplifies usage for casual users. Both Windows and Linux 'x86 are supported.
In combination with other freely-available conversion tools GutenMark aims to convert Project Gutenberg etexts into publication-quality Postscript or PDF, for print-on-demand applications. The goal is for this conversion to be completely automatic, without manual markup or editing, but for the forseeable future some manual intervention will almost always be needed—at least, if your standards are at least as high as mine.
What is Project Gutenberg?
Project Gutenberg—or PG for short—is a project for freely providing online books. Thousands of such "etexts" have been made available. Many are familiar classics, and many others are completely unfamiliar books you're unlikely to find anywhere else. I've provided a dozen or so of the etexts myself.
(GutenMark is not affiliated with Project Gutenberg in any way.)
Here are some key features of "GutenMark":
· Tries to deduce the title and author.
· Identifies the Project Gutenberg "fine print" header and, by default, removes it. At your option, it can also retain the header, but does not attempt to reformat it. The header will appear in a fixed-width font, unlike the remainder of the text.
· Usually, a PG etext will begin with items like title pages, tables of contents, notes from the person who created the etext, and so forth. These materials differ in format from etext to etext, and follow no obvious rules. GutenMark, tries to identify this section, which it entitles "Prefatory Materials", and performs only minor reformatting on it.
· Adds "smart quotes".
· Adds headings to chapters, sections, etc.
· Identifies paragraphs, and joins together the lines of the paragraph, so that word wrapping can be used. Paragraphs are right justified, by default.
· Distinguishes word-wrapped areas from verse.
· PG etexts are highly inconsistent in their handling of italicized text. Many etexts simply discard that information. Others mark italicized text in some ways, but that marking differs from etext to etext, or even within a single text. All PG or newsgroup italicizing styles I'm aware of are handled:
· _italicized_
· italicized
· /italicized/
· ~italicized~
· ~~italicized~~
·
· italicized
· _/italicized/_
· _italicized_
· /italicized/
· _/italicized/_
· /:italicized:/
· |:italicized:|
· ITALICIZED
· GutenMark automatically italicizes certain words like "etc.", "viz.", "i.e.", and so on. When wordlists are used, it by default italicizes all words which it can identify as being in a foreign language—i.e., a language other than the native language of the etext—with some exceptions such as proper names.
· When wordlists with built-in soft-hyphens are used (presently, only the Norwegian wordlist), text can be automatically hyphenated when (or if) HTML is converted to Postscript. Or, post-processing software (like html2ps) may be able to use TeX hyphenation files.
· Locates ends of sentences and colons, so that they can be followed by two spaces rather than one. Automatically recognizes that honorifics like "Mr. Smith" aren't ends of sentences, and that sentences may be in quotations. It recognizes that constructs like "929 N. Durello" are not the ends of sentences.
· Handles dangling hyphens at the ends of lines, so that they are not followed by spurious spaces.
· Can usually markup centered lines. (Though Project Gutenberg frowns on centered text, a lot of folks use it anyhow.)
· There are no practical limitations in terms of file sizes.
· Only a minuscule subset of HTML is used, so the marked-up files should have maximum portability.
· Traditionally, PG etexts have used so-called "7-bit" ASCII, but lately a number of "8-bit" ASCII texts have shown up. These 8-bit files more accurately represent the diacritical marks found in non-English texts. For example, 'ü' in an 8-bit etext shows up merely as 'u' in a 7-bit etext. GutenMark is able to handle both.
· GutenMark can also, to some extent, restore the diacritical marks which are not present at all in 7-bit ASCII etexts. For example, if we encounter the word "role" in a 7-bit English-language ASCII text, it will be converted to "rôle".
· LaTeX support has been added, providing an alternative to HTML output.
Requirements:
·
What's New in This Release:
· The GUI front-end now supports the GutenSplit utility in addition to supporting GutenMark itself.
· The GUI front-end now defaults to using the desktop for input and output.
· Mac OS X support has been reinstated, though with full support only for Leopard.
· Limited iPhone support has been added.
· Some portability issues with the Linux version have been fixed.
· GutenSplit has several new options.
· The installer programs have been made somewhat smaller.
0 comments
Downloadtube.com agrees that pirated software hurts software
developers and does not support such practices. We acknowledge and
agree that software piracy is theft. Using GutenMark crack,
GutenMark key generator, GutenMark keygen, as well as any
pirated GutenMark serial number/serial numbers and GutenMark
registration code/codes, warez, keys, torrents, passwords, is illegal
and does not support the future development of GutenMark.
Downloadtube.com advises that GutenMark should be only used in
accordance with the rules of intellectual property and the existing
Criminal Code. Downloadtube.com does not allow the inclusion of any
crack, serial or keygen and disclaims any liability for the
inappropriate use of GutenMark.
Add to
GutenMark Version History
Related Software
|
|
From category: Markup |
| File2XLIFF4j 20061005 is markup software developed by Sonny Zubia. File2XLIFF4j is a modular implementation of a tool that converts files to and from the OASIS standard XLIFF. XLIFF means XM... |
|
|
From category: Others |
| convmv 1.10 is others software developed by Bjorn Jacke. convmv converts filenames (not file content), directories, and even whole filesystems to a different encoding. This comes in very han... |
|
|
From category: Indexing |
| Estraier 1.2.29 is indexing software developed by Mikio Hirabayashi. Estraier is a full-text search system for personal use. Full-text search means functions to search lots of documents for some do... |
|
|
From category: Others |
| KevEdit is a ZZT world editor that is much more powerful than the built in editor.... |
|
|
From category: Vim-Plugins |
| Smartput is a Vim plugin that adjusts spaces and commas when putting text.... |
|
|
From category: Markup |
| itools is a collection of Python libraries which provides a wide range of capabilities.... |
|
|
From category: Markup |
| eXe 0.19 is markup software developed by The University of Auckland. The eLearning XHTML editor (eXe) is a web-based authoring environment designed to assist teachers and academics in the design, d... |
|
|
From category: Others |
| CSSED 0.4.0 is others software developed by ago Rubio. CSSED is a small developer editor and validator, that tries to ease the CSS editing. It is an Open Source project, it means that you can downl... |
|
|
From category: Markup |
| xsd is a W3C XML Schema to C++ translator.... |
|
|
From category: IDEs |
| BlueJ 2.1.3 is ides software developed by Michael Kolling. BlueJ project is an integrated Java environment (Java IDE) specifically designed for introductory teaching. The BlueJ environment w... |
|
|
From category: Vim-Plugins |
| Unicode Macro Table is a Vim script that allows you to easilyr egister insert mode macros for useful Unicode characters.... |
|
|
From category: Markup |
| Beautiful Soup 3.0.3 is markup software developed by Leonard Richardson. Beautiful Soup project is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping. Three featur... |
|
|
From category: Markup |
| EDIReader 3.10 is markup software developed by BerryWorks. EDIReader is a Java package for parsing business documents structured according to EDI (Electronic Data Interchange) standards. It... |
|
|
From category: Markup |
| Apache XMLRPC 3.0 RC1 is markup software developed by Apache Software Foundation. Apache XML-RPC is a Java implementation of XML-RPC, a popular protocol that uses XML over HTTP to implement remote... |
|
|
From category: IDEs |
| Chamo 1.9.17 is ides software developed by Chamo Team. Chamo is a source code editor, even if it can be used to edit any text file. A system of \"views\" allows to edit some kinds of files in speci... |
Leave a comment