Download GutenMark - GutenMark Description, GutenMark Reviews
Contact
 


 

Download

 
Download Now
GPL (GNU General Public License)
Downloads till now: 4
 
 

Quick search

 



 

Rate this software

  • Currently 0/5 Stars.
  • 1
  • 2
  • 3
  • 4
  • 5

No. Votes

0

 

Linux

Emacs , Filters , IDEs , Indexing, Markup , Others , Text Editors , Vim Plugins , Word Processors ,

Windows

Mac

Mobile

Drivers

Scripts - DHTML

Scripts - DHTML (new)

Web Developer Blog

Web Developer Blog (new)

Scripts and Applications

Ajax
ASP
ASP.NET
C and C++
CFML
CGI and Perl
Flash
Java
JavaScript
PHP
Python
XML

GutenMark

 

Details

Last Update: 2008-06-05 21:45:29
Version: 20080601
License/Program Type: GPL (GNU General Public License)
Publisher: Ronald S. Burkey
Price:$0.00
Description:

GutenMark is a Project Gutenberg markup tool.

GutenMark is a command-line tool for automatically creating high-quality HTML or LaTeX markup from Project Gutenberg etexts. As of April 2008, there is also a graphical front-end called GUItenMark that greatly simplifies usage for casual users. Both Windows and Linux 'x86 are supported.



In combination with other freely-available conversion tools GutenMark aims to convert Project Gutenberg etexts into publication-quality Postscript or PDF, for print-on-demand applications. The goal is for this conversion to be completely automatic, without manual markup or editing, but for the forseeable future some manual intervention will almost always be needed—at least, if your standards are at least as high as mine.

What is Project Gutenberg?

Project Gutenberg—or PG for short—is a project for freely providing online books. Thousands of such "etexts" have been made available. Many are familiar classics, and many others are completely unfamiliar books you're unlikely to find anywhere else. I've provided a dozen or so of the etexts myself.

(GutenMark is not affiliated with Project Gutenberg in any way.)

Here are some key features of "GutenMark":
· Tries to deduce the title and author.
· Identifies the Project Gutenberg "fine print" header and, by default, removes it. At your option, it can also retain the header, but does not attempt to reformat it. The header will appear in a fixed-width font, unlike the remainder of the text.
· Usually, a PG etext will begin with items like title pages, tables of contents, notes from the person who created the etext, and so forth. These materials differ in format from etext to etext, and follow no obvious rules. GutenMark, tries to identify this section, which it entitles "Prefatory Materials", and performs only minor reformatting on it.
· Adds "smart quotes".
· Adds headings to chapters, sections, etc.
· Identifies paragraphs, and joins together the lines of the paragraph, so that word wrapping can be used. Paragraphs are right justified, by default.
· Distinguishes word-wrapped areas from verse.
· PG etexts are highly inconsistent in their handling of italicized text. Many etexts simply discard that information. Others mark italicized text in some ways, but that marking differs from etext to etext, or even within a single text. All PG or newsgroup italicizing styles I'm aware of are handled:

· _italicized_
· italicized
· /italicized/
· ~italicized~
· ~~italicized~~
·
· italicized
· _/italicized/_
· _italicized_
· /italicized/
· _/italicized/_
· /:italicized:/
· |:italicized:|
· ITALICIZED

· GutenMark automatically italicizes certain words like "etc.", "viz.", "i.e.", and so on. When wordlists are used, it by default italicizes all words which it can identify as being in a foreign language—i.e., a language other than the native language of the etext—with some exceptions such as proper names.
· When wordlists with built-in soft-hyphens are used (presently, only the Norwegian wordlist), text can be automatically hyphenated when (or if) HTML is converted to Postscript. Or, post-processing software (like html2ps) may be able to use TeX hyphenation files.
· Locates ends of sentences and colons, so that they can be followed by two spaces rather than one. Automatically recognizes that honorifics like "Mr. Smith" aren't ends of sentences, and that sentences may be in quotations. It recognizes that constructs like "929 N. Durello" are not the ends of sentences.
· Handles dangling hyphens at the ends of lines, so that they are not followed by spurious spaces.
· Can usually markup centered lines. (Though Project Gutenberg frowns on centered text, a lot of folks use it anyhow.)
· There are no practical limitations in terms of file sizes.
· Only a minuscule subset of HTML is used, so the marked-up files should have maximum portability.
· Traditionally, PG etexts have used so-called "7-bit" ASCII, but lately a number of "8-bit" ASCII texts have shown up. These 8-bit files more accurately represent the diacritical marks found in non-English texts. For example, 'ü' in an 8-bit etext shows up merely as 'u' in a 7-bit etext. GutenMark is able to handle both.
· GutenMark can also, to some extent, restore the diacritical marks which are not present at all in 7-bit ASCII etexts. For example, if we encounter the word "role" in a 7-bit English-language ASCII text, it will be converted to "rôle".
· LaTeX support has been added, providing an alternative to HTML output.

Requirements:
·

What's New in This Release:

· The GUI front-end now supports the GutenSplit utility in addition to supporting GutenMark itself.
· The GUI front-end now defaults to using the desktop for input and output.
· Mac OS X support has been reinstated, though with full support only for Leopard.
· Limited iPhone support has been added.
· Some portability issues with the Linux version have been fixed.
· GutenSplit has several new options.
· The installer programs have been made somewhat smaller.


Leave a comment




(optional)

What is 7-3?




0 comments


Add to

 Del.icio.us   Digg It   Furl   YahooMyWeb   Blinklist
 

GutenMark Version History

Product Date Added
GutenMark 20080601 2008-06-05 21:45:29


Related Software

Atomsphere 1.0.1.0
From category: Markup
Atomsphere 1.0.1.0 is markup software developed by Bill Brown. Atomsphere is a Java library for creating and modifying Atom 1.0 compliant feed documents. Atomsphere is also bundled with a servlet-b...
Emacs Common Lisp 20061030
From category: Emacs
Emacs Common Lisp 20061030 is emacs software developed by Lars Brinkhoff. Emacs Common Lisp is an implementation of Common Lisp, written in Emacs Lisp. It does not yet purport to conform to the ANS...
Estraier
From category: Indexing
Estraier 1.2.29 is indexing software developed by Mikio Hirabayashi. Estraier is a full-text search system for personal use. Full-text search means functions to search lots of documents for some do...
SiSU
From category: Markup
SiSU (Serialized information, Structured Units) is is a document creation and management framework....
eXe
From category: Markup
eXe 0.19 is markup software developed by The University of Auckland. The eLearning XHTML editor (eXe) is a web-based authoring environment designed to assist teachers and academics in the design, d...
EasyEclipse Expert Java
From category: IDEs
EasyEclipse Expert Java is bare-bones Eclipse distro for experienced Java developers who are new to Eclipse....
Syntext Serna
From category: Markup
Syntext Serna is a highly customizable, multi-platform, pure XSL-driven WYSIWYG XML content editor....
xsd
From category: Markup
xsd is a W3C XML Schema to C++ translator....
Cloc
From category: Filters
Cloc counts blank lines, comment lines, and physical lines of source code in many programming languages....
Fid Emacs
From category: Emacs
Fid Emacs 0.2 is emacs software developed by Jon Cast. Fid Emacs project is an Emacs-like text editor integrated with the Frigand Imperial Desktop. It uses Fid\'s mechanisms for buffers, win...
EPIC
From category: IDEs
EPIC 0.4.0 is ides software developed by Jan Ploski. EPIC is a Perl IDE based on the Eclipse platform. Features supported are syntax highlighting, on-the-fly syntax checking, content assista...
Enca
From category: Others
Enca 1.9 is others software developed by David Necas. Enca detects the encoding of text files, on the basis of knowledge of their language. Enca is an Extremely Naive Charset Analyser. It de...
Beautiful Soup
From category: Markup
Beautiful Soup 3.0.3 is markup software developed by Leonard Richardson. Beautiful Soup project is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping. Three featur...
docbookm
From category: Markup
docbookm 0.2.0 is markup software developed by Robert Bienert. docbookm project is contributed with LayManSys and contains very simple XSLT drivers for generating XHTML chunks from DocBook XML....
Auto-recompile 1.1
From category: Others
Auto-recompile 1.1 is others software developed by Fredrik Hubinette. Auto-recompile is a small emacs add-on that allows you to fix compilation errors faster. It does this by continuously compiling...
 

Top Downloads

 
1. Canon PIXMA iP1000 Printer Driver
2. Canon PIXMA iP1200 Printer Driver x64 d
3. Canon PIXMA iP1200 Printer Driver
4. Realtek ALC/ 262/ 265/ 268/ 660/ 861/ 880/ 882/ 883/ 885/ 888 Audio
5. Canon PIXMA iP1300 Printer Driver a
6. Canon PIXMA MP210 MP Drivers
7. Canon PIXMA iP1600 Printer Driver
8. Canon PIXMA MP160 MP Drivers xp64
9. Canon PIXMA MP160 MP Drivers 9xME
10. Canon PIXMA iP1300 Printer Driver c
11. Asus EZVcr II
12. Canon i-SENSYS LBP2900 Printer Driver R
13. Canon i560 Printer Driver
14. Canon LaserShot LBP-1210 Printer Driver
15. SendSong
16. Realtek RTL8139C(L)+/RTL8139D(L)/RTL8100(L)/RTL8130/RTL8139B(L) Driver
17. Realtek RTL8100B(L)/RTL8100C(L)/RTL8101L/RTL8139C(L) Driver XP
18. Genius Eye 110 Webcam Driver
19. Mercury KPC-6225V-MH
20. Alcatel SpeedTouch 330/USB

DownloadTube Editor Reviews

 
1. Sudoku Solver Software
Sudoku Solver Software is a simple yet smart and reliable to...
2. Easy PC Firewall
WARNING: According to avast! 4.8, Easy PC Firewall contains ...
3. Anti Tracks Kit
Anti Tracks Kit is a simple yet powerful and reliable softwa...
4. PerfectClock Trader Edition
PerfectClock Trader Edition is a FREEWARE, feature limited v...
5. ProLingo Italian to English Dictionary
ProLingo Italian to English is a really nice, easy to use, a...
6. Tinysoar dvd to ipod converter
Tinysoar dvd to ipod converter will allow you to easily copy...
7. Tinysoar ipod value pack
Tinysoar ipod value pack includes the Tinysoar dvd to ipod c...
8. Tinysoar ipod video converter
Tinysoar ipod video converter is a simple to use tool that c...
9. Financial Icon Library
Vista Financial Icon Library is a stunning collection of mon...
10. Tinysoar iphone video converter
Tinysoar iphone video converter is a smart, simple tool that...

Software Reviews Full List



Recent Blog Posts

 
1. Google Chrome – It’s Finally Here. Will A Revolution Begin?
First, it was the rumors. Then, Google announced it official...
2. An Amazing Free Document Processing Software: LyX
The documents management task could be difficult in absence ...
3. DownloadTube Toolbar is Available For Free Download
Recently, we have made available for free download the Dow...
4. A Revolution in Web Browsing: The New Firefox 3.1b1 Already Beats All Speed Records
The latest beta1 release of Mozilla Firefox 3.1 shows majo...
5. Some Little, Nice, Freeware Tools You May Never Know When You'll Need
This time I won’t speak about a single freeware program that...
6. How To Increase The Quality of Your News Articles For Search Engine Spiders
The process of articles publishing is a common practice to...
7. Digg in Press: Tips and Opinions
Regarding Digg social bookmarking service there are many a...
8. Ubuntu Linux and Windows Can Share The Desktop In Absence Of Virtual Machines
Many people asked themselves how to run Ubuntu Linux and W...
9. 2.5 Millions Downloads for FireTune: It Makes Mozilla Firefox To Run With The Speed of Light
It is well known the fact that even the latest version of M...
10. Image Galleries on Autopilot: Instant Gallery Maker
The creation of image galleries ready for web publishing...

Last 20 Scripts

 
1. Ninja Blog
Ninja Blog is a PHP based blogging solution. Based upon word
2. Dragonfly CMS
DragonflyCMS is a content management system based on PHP-Nuk
3. Diferior
Diferior is a flexible, customizable, both user and develope
4. DBHcms
DBHcms is a search engine optimized and lightweight content
5. concrete5
concrete5 content management system could be a rapid solutio
6. bloofoxCMS
bloofoxCMS is a lightweight content management system based
7. PHP Membership
PHP Membership script allows you to add password protection
8. Tube Spider
Tube Spider allows your visitors to search videos in Youtube
9. Azure CMS
Azure CMS is a universal software product for the developmen
10. Azure Portal
Azure Portal is a social networking script made with PHP pro
11. One Frog
One Frog is a content management system that allows you to u
12. Cigmas CMS
Cigmas CMS is a powerful web content management system for g
13. WebWord CMS
WebWord CMS is a full featured web content management system
14. Marjetica Content Management System
Marjetica Content Management System is a powerful, easy to u
15. Phenotype CMS
Phenotype CMS is a PHP/MySQL - Smarty Content Application Fr
16. Chupix CMS
Chupix is a content management system written in PHP and sto
17. Interspire Website Publisher
Interspire Website Publisher (formerly ArticleLive) is a con
18. Interspire Email Marketer
Interspire Email Marketer (formerly SendStudio) is a web bas
19. Comments RAM
Comments RAM is a lightweight PHP script that allows you to
20. KoolAjax
KoolAjax facilitates data exchange between server-side and c