This paper addresses the issue of text normalization,
an important yet often overlooked
problem in natural language processing.
By text normalization, we mean
converting 'informally inputted' text into
the canonical form, by eliminating 'noises'
in the text and detecting paragraph and sentence
boundaries in the text.