Rotation invariant texture features and their use in automatic script identification 论文

1998IEEE Transactions on Pattern Analysis and Machine Intelligence引用 280
Handwritten Text Recognition TechniquesImage Processing and 3D ReconstructionImage Retrieval and Classification Techniques

详细信息

发表期刊/会议
IEEE Transactions on Pattern Analysis and Machine Intelligence
发表日期
1998-07-01
发表年份
1998

关键词

Handwritten Text Recognition TechniquesImage Processing and 3D ReconstructionImage Retrieval and Classification Techniques

摘要

Concerns the extraction of rotation invariant texture features and the use of such features in script identification from document images. Rotation invariant texture features are computed based on an extension of the popular multi-channel Gabor filtering technique, and their effectiveness is tested with 300 randomly rotated samples of 15 Brodatz textures. These features are then used in an attempt to solve a practical but hitherto mostly overlooked problem in document image processing-the identification of the script of a machine printed document. Automatic script and language recognition is an essential front-end process for the efficient and correct use of OCR and language translation products in a multilingual environment. Six languages (Chinese, English, Greek, Russian, Persian, and Malayalam) are chosen to demonstrate the potential of such a texture-based approach in script identification.