Robust table recognition for printed document images

Qiaokang Liang; Jianzhong Peng; Zhengwei Li; Daqi Xie; Wei Sun; Yaonan Wang; Dan Zhang; Qiaokang Liang; Jianzhong Peng; Zhengwei Li; Daqi Xie; Wei Sun; Yaonan Wang; Dan Zhang

doi:10.3934/mbe.2020182

Mathematical Biosciences and Engineering

2020, Volume 17, Issue 4: 3203-3223. doi: 10.3934/mbe.2020182

Previous Article Next Article

Research article

Robust table recognition for printed document images

1.
College of Electrical and Information Engineering, Hunan University, Changsha 410082, China
2.
National Engineering Laboratory for Robot Vision Perception and Control, Hunan University, Changsha 410082, China
3.
Department of Mechanical Engineering, University of Alberta, Edmonton, AB T6G 2R3, Canada
4.
Department of Mechanical Engineering, York University, Toronto, ON M3J 1P3, Canada

Received: 19 December 2019 Accepted: 12 April 2020 Published: 23 April 2020

The recognition and analysis of tables on printed document images is a popular research field of the pattern recognition and image processing. Existing table recognition methods usually require high degree of regularity, and the robustness still needs significant improvement. This paper focuses on a robust table recognition system that mainly consists of three parts: Image preprocessing, cell location based on contour mutual exclusion, and recognition of printed Chinese characters based on deep learning network. A table recognition app has been developed based on these proposed algorithms, which can transform the captured images to editable text in real time. The effectiveness of the table recognition app has been verified by testing a dataset of 105 images. The corresponding test results show that it could well identify high-quality tables, and the recognition rate of low-quality tables with distortion and blur reaches 81%, which is considerably higher than those of the existing methods. The work in this paper could give insights into the application of the table recognition and analysis algorithms.

Keywords:

Citation: Qiaokang Liang, Jianzhong Peng, Zhengwei Li, Daqi Xie, Wei Sun, Yaonan Wang, Dan Zhang. Robust table recognition for printed document images[J]. Mathematical Biosciences and Engineering, 2020, 17(4): 3203-3223. doi: 10.3934/mbe.2020182

Related Papers:

[1]	Shuai Cao, Biao Song . Visual attentional-driven deep learning method for flower recognition. Mathematical Biosciences and Engineering, 2021, 18(3): 1981-1991. doi: 10.3934/mbe.2021103
[2]	Xiao Ma, Xuemei Luo . Finger vein recognition method based on ant colony optimization and improved EfficientNetV2. Mathematical Biosciences and Engineering, 2023, 20(6): 11081-11100. doi: 10.3934/mbe.2023490
[3]	Jia-Gang Qiu, Yi Li, Hao-Qi Liu, Shuang Lin, Lei Pang, Gang Sun, Ying-Zhe Song . Research on motion recognition based on multi-dimensional sensing data and deep learning algorithms. Mathematical Biosciences and Engineering, 2023, 20(8): 14578-14595. doi: 10.3934/mbe.2023652
[4]	Yuanyao Lu, Kexin Li . Research on lip recognition algorithm based on MobileNet + attention-GRU. Mathematical Biosciences and Engineering, 2022, 19(12): 13526-13540. doi: 10.3934/mbe.2022631
[5]	Jinhua Zeng, Xiulian Qiu, Shaopei Shi . Image processing effects on the deep face recognition system. Mathematical Biosciences and Engineering, 2021, 18(2): 1187-1200. doi: 10.3934/mbe.2021064
[6]	Qingwei Wang, Xiaolong Zhang, Xiaofeng Li . Facial feature point recognition method for human motion image using GNN. Mathematical Biosciences and Engineering, 2022, 19(4): 3803-3819. doi: 10.3934/mbe.2022175
[7]	Boyang Wang, Wenyu Zhang . ACRnet: Adaptive Cross-transfer Residual neural network for chest X-ray images discrimination of the cardiothoracic diseases. Mathematical Biosciences and Engineering, 2022, 19(7): 6841-6859. doi: 10.3934/mbe.2022322
[8]	Jing Wang, Jiaohua Qin, Xuyu Xiang, Yun Tan, Nan Pan . CAPTCHA recognition based on deep convolutional neural network. Mathematical Biosciences and Engineering, 2019, 16(5): 5851-5861. doi: 10.3934/mbe.2019292
[9]	Zilong Liu, Jingbing Li, Jing Liu . Encrypted face recognition algorithm based on Ridgelet-DCT transform and THM chaos. Mathematical Biosciences and Engineering, 2022, 19(2): 1373-1387. doi: 10.3934/mbe.2022063
[10]	Yongmei Ren, Xiaohu Wang, Jie Yang . Maritime ship recognition based on convolutional neural network and linear weighted decision fusion for multimodal images. Mathematical Biosciences and Engineering, 2023, 20(10): 18545-18565. doi: 10.3934/mbe.2023823

Abstract

1. Introduction

Tables in documents such as product catalogues, balance sheets, and financial reports are important expressive objects that present statistical and relational information. In the past several decades, Optical Character Recognition (OCR) is widely implemented in various applications by converting printed text into editable text, such as archival literature, office automation and license plate recognition ^[1]. This advanced technology integrates the digital image processing, computer vision and other disciplines. The rapid development of OCR has promoted the transformation of many industries, since it can significantly save the working hours, as well as the labor costs. However, the printed document recognition remains challenging. For example, images obtained by photographing or scanning contain a lot of complicated information, i.e., tables, formulas, images, and a large number of Chinese characters.

The general document image character recognition is mainly accomplished by the following steps ^[2]. First, we obtain the information of documents in a real scene by photographing or scanning the original paper documents stored in the form of image. Secondly, we apply the knowledge of the image to analyze the layout of the image, then separate the corresponding modules and send each module to the corresponding processor for processing. Thirdly, employ different functions of different modules in the related technology of document character recognition to distinguish and identify the characters in each section. The last two steps play a vital role in the document image recognition. Unlike other document recognition technologies, the table recognition requires not only extracting the frame and lines of the table, but also obtaining useful information contained in the table, such as numbers, characters, and formulas.

In the printed documents, the form of tables can be mainly divided into two categories. One is the mixed type document including pictures, characters, tables, etc. The other one is only composed of tables, e.g. financial statement, transcripts, and other single structured tables. The latter one is less challenging since the structure and information of the table can be directly extracted and identified after analyzing the table. For the former, it is more complicated because it has to preprocess the document image to minimize its own noise interference, extract the table parts, and use algorithms for identification and analysis. This paper focuses on the recognition of the commonly used tables of the former type formed by rectangular elements.

There are a large number of recognition approaches in the field of image processing for various recognition tasks. Ranka et al. ^[3] tackled the problem of table detection and retention by proposing a bi-modular approach based on structural information of tables includes bounding lines, row/column separators, spaces between columns. Experiments on a dataset of above 600 images consisting of more than 829 tables have detected 90% of the table correctly. Kasar et al. ^[4] presented a query-based approach to selectively extract tabular information and recognize the table structure from scanned documents. The query pattern is first transformed into an attributed relational graph and a fast graph matching technique was then used to retrieve other similar graphs from the document images. Cuevas ^[5] presented the Block-matching algorithm based on harmony search optimization for motion estimation which could be viewed as an optimization problem whose goal was to find the best-matching block within a search space. The average number of search points visited by the HS-BM algorithm ranges from 9.2 to 17.3, representing 4 and 7.4% respectively in comparison to the FSA method. Sage et al. ^[6] proposed a generic method for end-to-end table field extraction that started with the sequence of document tokens segmented by an OCR engine. The proposed method outperformed the feedforward network with a token level recurrent neural network combining spatial and textual features.

As the initial approach in table recognition, the commonly used preprocessing algorithms generally include denoise, image binarization, tilt correction and perspective correction. In the first step, denoise can make the tables and character information in the images more prominent. The binarization algorithm used in the second step is crucial to the recognition result since it can enhance the foreground components and weaken the background components. Based on the focused level of information, binarization algorithms can be generally categorized into the global algorithms and the local algorithms ^[7]. The global binarization algorithms select a single intensity threshold that separate pixels into two classes, the foreground and the background, by maximizing intra-class intensity variance for the entire grayscale image. The typical global binarization algorithms are Otsu algorithm ^[8] and iterative method ^[9]. By contrast, local binarization algorithms divide the image into small block units, and estimate different thresholds for every pixel according to the grayscale information of its neighboring pixels. Several local binarization methods have been proposed such as Niblack ^[10], Sauvola ^[11,12], and Bernsen ^[13] algorithms. Generally, global binarization algorithms perform well with high efficiency for typical scanned table images, while local binarization methods can deal with table images with high computational complexity. It should be mentioned that binarization algorithms have been constantly optimized to adapt to various light conditions ^[14]. In the third step, to address the issues caused by the tilted or deformed table images, the tilt correction algorithms or the perspective correction algorithms have been utilized respectively, such as the projection-based and Hough transform-based methods ^[15]. Besides, layout analysis of document images is also implemented in certain scenarios by using the top-down method and the bottom-up method ^[16,17].

Significant efforts also have been made to develop methods and algorithms for table recognition after the preprocessing of the table images. The extraction methods are proposed to identify the geometric structures of the tables based on the different logical relations ^[18]. One of the most extensively used extraction method is the projection method, which projects the table in the horizontal and vertical directions respectively and obtains the horizontal and vertical line segments. In addition, a model-based approach is formed to obtain the characteristics of the tables with the topological relationship between table cells ^[19], which makes the result of this approach more accurate and flexible than that of the extraction method. Methods based on formal description languages have been developed. For example, some methods utilize the Latex typesetting system as a language description module to represent the table. The Latex typesetting system describes tables by means of table description language, and the structure of the table is parsed and saved.

The early classification networks for character recognition are mainly built based on the AlexNet and ResNet networks designed by Microsoft ^[20,21]. With the rapid development of deep learning, especially the deep learning frameworks such as convolutional neural network (CNN) ^[22], it becomes possible to develop the end-to-end character recognition systems. Compared with the early classification networks, the recognition algorithms based on deep convolutional networks have strong fault tolerance and classification ability, and do not require complicated pre-processing and feature extracting, which significantly reduces the recognition complexity and obtains a higher recognition accuracy. Table recognition workflow comprises table image pre-processing, table detection, and text character recognition.

It is, therefore, the objective of this paper is to develop an efficient recognition system for table images by integrating the advanced algorithms, especially the deep learning framework. The main processing procedures of the proposed table image recognition have been outlined in Figure 1 and the remainder of the paper is organized as follows. Section 2 illustrates the algorithms implemented in the proposed system to preprocess the table images. In Section 3, the approaches utilized to extract the table lines and locate the table cells have been demonstrated. After that, the deep learning framework is proposed in Section 4 to recognize the printed characters. By integrating the proposed algorithms and approaches, an android-based application for table image recognition has been developed in Section 5 and its effectiveness has been verified with practical tests. Finally, the main conclusions are summarized in Section 6.

Figure 1. Flow chart of the overall table recognition system.

Recognition network	The number of the effective recognition	Recognition rate	Recognition time of each piece (s)
Resnet	189	94.5%	2.2
CNN + LSTM	193	96.5%	0.05

Method	Inspection object	Description	Accuracy	Application
Fan et al. ^[48] (2015)	Table on PDF files	Apache PDFBox & Stanford NLP toolkit & classifiers (Naive Bayes, Logistic Regression and Support Vector Machine)	0.7948	PDF file
Gilani et al. ^[49] (2017)	Table on document images with varying layouts	Image transformation & deep learning	0.8629	Document & research paper & magazine
Koci et al. ^[50] (2017)	Table structure in spreadsheets	Heuristics-based method	0.78	Spreadsheet
Arif et al. ^[51] (2018)	Tabular regions from document images	Color coding or coloration & Faster R-CNN	0.8964	Document image
*****FineReader	Text document	online recognition-server	0.6850	Text document & document image
The proposed	Table and character on phone images	Image processing & CNN & RNN	0.8667	Natural or unnatural scene image taken by mobile phone

[1]	H. Singh, A. Sachan, A Proposed Approach for Character Recognition Using Document Analysis with OCR, 2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS), 2018,190-195. Available from: https://ieeexplore.ieee.org/abstract/document/8663011.
[2]	A. M. Sabu, A. S. Das, A Survey on various Optical Character Recognition Techniques, 2018 Conference on Emerging Devices and Smart Systems (ICEDSS), 2018,152-155. Available from: https://ieeexplore.ieee.org/abstract/document/8544323.
[3]	V. Ranka, S. Patil, S. Patni, T. Raut, K. Mehrotra, M. K. Gupta, Automatic Table Detection and Retention from Scanned Document Images via Analysis of Structural Information, 2017 Fourth International Conference on Image Information Processing (ICIIP), 2017,244-249. Available from: https://ieeexplore.ieee.org/abstract/document/8313719/.
[4]	T. Kasar, T. K. Bhowmik, A. Belaïd, Table information extraction and structure recognition using query patterns, 2015 13th International Conference on Document Analysis and Recognition(ICDAR), 2015, 1086-1090. Available from: https://ieeexplore.ieee.org/abstract/document/7333928.
[5]	E. Cuevas, Block-matching algorithm based on harmony search optimization for motion estimation, Appl. Intell., 39 (2013), 165-183. doi: 10.1007/s10489-012-0403-7
[6]	C. Sage, A. Aussem, H. Elghazel, V. Eglin, J. Espinas, Recurrent Neural Network Approach for Table Field Extraction in Business Documents, International Conference on Document Analysis and Recognition(ICDAR), 2019. Available from: https://hal.archives-ouvertes.fr/hal-02156269/.
[7]	A. Shrivastava, D. K. Srivastava, A Review on Pixel-Based Binarization of Gray Images, Proceedings of the International Congress on Information and Communication Technology, 2016,357-364. Available from: https://link.springer.com/chapter/10.1007/978-981-10-0755-2_38.
[8]	A. K. Khambampati, D. Liu, S. K. Konki; K. Y. Kim, An Automatic Detection of the ROI Using Otsu Thresholding in Nonlinear Difference EIT Imaging, IEEE Sens. J., 18 (2018), 5133-5142. doi: 10.1109/JSEN.2018.2828312
[9]	M. Valizadeh, E. Kabir. Partitioning of feature space by iterative classification for degraded document image binarization, IET image Process., 6 (2012), 804-812. doi: 10.1049/iet-ipr.2011.0399
[10]	L. P. Saxena, Niblack's binarization method and its modifications to real-time applications: A review, Artif. Intell. Rev., 51 (2019), 673-705. doi: 10.1007/s10462-017-9574-2
[11]	M. Kiran, I. Ahmed, N. Khan, A. G. Reddy, Chest X-ray segmentation using Sauvola thresholding and Gaussian derivatives responses, J. Ambient Intell. Humanized Comput., 10 (2019), 4179-4195. doi: 10.1007/s12652-019-01281-7
[12]	Z. Hadjadj, A. Meziane, Y. Cherfa, M. Cheriet, I. Setitra, ISauvola: Improved Sauvola's Algorithm for Document Image Binarization, International Conference on Image Analysis and Recognition, 2016,737-745. Available from: https://link.springer.com/chapter/10.1007/978-3-319-41501-7_82.
[13]	L. Yang, Q. Feng. The Improvement of Bernsen Binarization Algorithm for QR Code Image, 2018 5th IEEE International Conference on Cloud Computing and Intelligence Systems (CCIS), 2018,931-934. Available from: https://ieeexplore.ieee.org/abstract/document/8691255.
[14]	I. Pratikakis, K. Zagoris, G. Barlas, B. Gatos, ICFHR2016 Handwritten Document Image Binarization Contest (H-DIBCO 2016), 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), 2016. Available from: https://ieeexplore.ieee.org/abstract/document/7814134.
[15]	O. Boudraa, W. K. Hidouci, D. Michelucci, Using skeleton and Hough transform variant to correct skew in historical documents, Math. Comput. Simul., 167 (2020), 389-403. doi: 10.1016/j.matcom.2019.05.009
[16]	T. A. Tran, K Oh, I. S. Na, G. S. Lee, H. J. Yang, S. H. Kim, A robust system for document layout analysis using multilevel homogeneity structure, Expert Syst. Appl., 85 (2017), 99-113. doi: 10.1016/j.eswa.2017.05.030
[17]	J. Ryu, H. I. Koo, N. I. Cho, Word Segmentation Method for Handwritten Documents based on Structured Learning, IEEE Signal Process. Lett., 22 (2015), 1161-1165. doi: 10.1109/LSP.2015.2389852
[18]	A. Riad, C. Sporer, S. S. Bukhari, A. Dengel, Classification and Information Extraction for Complex and Nested Tabular Structures in Images, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), 2017, 1156-1161. Available from: https://ieeexplore.ieee.org/abstract/document/8270122.
[19]	H. T. Tran, T. A. Tran, I. S. Na, S. H. Kim, Cell decomposition for the table in document image based on analysis of texts and lines distribution, 2016 Eighth International Conference on Ubiquitous and Future Networks (ICUFN), 2016,736-738. Available from: https://ieeexplore.ieee.org/abstract/document/7537135.
[20]	A. Krizhevsky, I. Sutskever, G. E. Hinton, ImageNet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems 25 (NIPS 2012), 2012, 1097-1105. Available from: http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networ.
[21]	K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition, In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016,770-778. Available form: http://openaccess.thecvf.com/content_cvpr_2016/html/He_Deep_Residual_Learning_CVPR_2016_paper.html.
[22]	Y. Wei, Y. Zhao, C. Lu, S. Wei, L. Liu, Z. Zhu, et al. Cross-Modal Retrieval with CNN Visual Features: A New Baseline, IEEE Trans. Cybern., 47 (2017), 449-460.
[23]	C. Tian, Y. Xu, W. Zuo, Image denoising using deep CNN with batch renormalization, Neural Networks, 121 (2020), 461-473. doi: 10.1016/j.neunet.2019.08.022
[24]	D. Yang, H. Zhou, L. Tang, S. Chen, S. Liu, A License Plate Tilt Correction Algorithm Based on the Character Median Line Algorithm de correction d's inclinaison de plaque d's immatriculation base sur la ligne mediane du character, Can. J. Electr. Computer Eng., 41 (2018), 145-150.
[25]	Q. An, J. Shi, J. Li, F. Cai, Elevator button recognition using auto-slant correction and projection histogram, 2017 10 th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), 2017. Available from: https://ieeexplore.ieee.org/abstract/document/8302054.
[26]	R. Baran, A. Dziech, J. Wassermann, Contour Extraction and Compression Scheme Utilizing Both the Transform and Spatial Image Domains, International Conference on Multimedia Communications, Services and Security, 1-15. Available from: https://link.springer.com/chapter/10.1007/978-3-319-69911-0_1.
[27]	J. Tang, H, Huang, L. Shi, Z. Chen, Y. Lu, H. Chen, An Improved Perspective Transform for Image Distortion Correction, 2018 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW), 2018. Available from: https://ieeexplore.ieee.org/abstract/document/8448538/.
[28]	Q. Vien, H. X. Nguyen, B. Barn, X. Tran, On the Perspective Transformation for Efficient Relay Placement in Wireless Multicast Networks, IEEE Commun. Lett., 19 (2015), 275-278. doi: 10.1109/LCOMM.2014.2387163
[29]	A. C. Jalba, M. H. F. Wilkinson, J. B. T. M. Roerdink, Shape representation and recognition through morphological curvature scale spaces, IEEE Trans. Image Process., 15 (2006), 331-341. doi: 10.1109/TIP.2005.860606
[30]	Y. Li, H. Zheng, Z. Yan, L. Chen. Detail preservation and feature refinement for object detection, Neurocomputing, 359 (2019), 209-218. doi: 10.1016/j.neucom.2019.05.086
[31]	M. Naseri, S. Heidari, R. Gheibi, L. Gong, M. A. Raiji, A. Sadri, A novel quantum binary images thinning algorithm: A quantum version of the Hilditch's algorithm, Optik, 131 (2017), 678-686. doi: 10.1016/j.ijleo.2016.11.124
[32]	C. Zhang, W. Zhong, C. Zhang, X. Qin, Simulation Design of Improved OPTA Thinnin Algorithm, International Conference on Mechatronics and Intelligence Roboyics (ICMIR), 2017,105-114. Available from: https://link.springer.com/chapter/10.1007/978-3-319-70990-1_15.
[33]	A. K. J. Saudagar, H. V. Mohammed, OpenCV Based Implementation of Zhang-Suen Thinning Algorithm Using Java for Arabic Text Recognition, Information Systems Design and Intelligent Applications, 2016,265-271. Available from: https://link.springer.com/chapter/10.1007/978-81-322-2757-1_27.
[34]	X. Shi, Y. Huang, Y. Liu, Text on Oracle rubbing segmentation method based on connected domain, 2016 IEEE Advanced Information Management, Commuincates, Electronic and Automation Control Conference (IMCEC), 2016: 414-418. Available from: https://ieeexplore.ieee.org/abstract/document/7867245.
[35]	Y. Sun, Z. Guo, W. Qiu, Research on the Handwriting Character Recognition Technology Based on the Image Statistical Characteristics, International Conference on Geo-Spatial Knowledge and Intelligence, 2018, 13-20. Available from: https://link.springer.com/chapter/10.1007/978-981-13-0896-3_2.
[36]	A. K. Sharma, P. Thakkar, D. M. Adhyaru, T. H. Zaveri, Handwritten Gujarati Character Recognition Using Structural Decomposition Technique, Pattern Recognit. Image Anal., 29 (2019), 325-338. doi: 10.1134/S1054661819010061
[37]	M. D. Zeiler, R. Fergus, Visualizing and understanding convolutional networks, European Conference on Computer Vision. Cham, Switzerland: Springer International Publishing AG, 2014,818-833. Available from: https://link.springer.com/chapter/10.1007/978-3-319-10590-1_53.
[38]	K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale image recognition, arXiv preprint arXiv: 1409.1556, 2014.
[39]	C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, Going deeper with convolutions, In Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, 1-9. Available from: https://www.cv-foundation.org/openaccess/content_cvpr_2015/html/Szegedy_Going_Deeper_With_2015_CVPR_paper.html.
[40]	G. Huang, Z. Liu, L. Van Der Maaten, K. Q. Weinberger, Densely connected convolutional networks, In Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, 4700-4708. Available from: http://openaccess.thecvf.com/content_cvpr_2017/html/Huang_Densely_Connected_Convolutional_CVPR_2017_paper.html.
[41]	N. K. Manaswi, Deep Learning with Applications Using Python, Springer, (2018), 115-126.
[42]	J. Chung, C. Gulcehre, K. Cho, Y. Bengio, Empirical evaluation of gated recurrent neural networks on sequence modeling, arXiv: 1412.3555, 2014.
[43]	J. Chung, S. Ahn, Y. Bengio, Hierarchical multiscale recurrent neural networks, arXiv: 1609.01704, 2016.
[44]	G. Liu, J. Guo, Bidirectional LSTM with attention mechanism and convolutional layer for text classification, Neurocomputing, 337 (2019), 325-338. doi: 10.1016/j.neucom.2019.01.078
[45]	Y. Bengio, P. Simard, P. Frasconi, Learning long-term dependencies with gradient descent is difficult, IEEE Trans. Neural Networks, 5 (1994), 157-166. doi: 10.1109/72.279181
[46]	(CRNN) Chinese Characters Recognition, 2020. Available from: https://github.com/Sierkinhane/crnn_chinese_characters_rec.
[47]	S. Ruder, An overview of gradient descent optimization algorithms, 2016. Available from: http://sebastianruder.com/optimizing-gradient-descent/index.html.
[48]	M. Fan, D. S. Kim, Detecting Table Region in PDF Documents Using Distant Supervision, arXiv: 1506.08891, 2015.
[49]	A. Gilani, S. R. Qasim, I. Malik, F. Shafait, Table Detection Using Deep Learning, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), 2017,771-776. Available from: https://ieeexplore.ieee.org/abstract/document/8270062.
[50]	E. Koci, M. Thiele, O. Romero, W. Lehner, Table Identification and Reconstruction in Spreadsheets, International Conference on Advanced Information Systems Engineering (CAiSE), 2017,527-541, Available from: https://link.springer.com/chapter/10.1007/978-3-319-59536-8_33.
[51]	S. Arif, F. Shafait, Table Detection in Document Images using Foreground and Background Features, Digital Image Computing: Techniques and Applications (DICTA), 2018. Available from: https://ieeexplore.ieee.org/abstract/document/8615795.

1.	Devendra Tiwari, Anand Gupta, Table structure recognition using black widow based mutual exclusion and RESNET attention model, 2024, 46, 10641246, 1101, 10.3233/JIFS-232646
2.	Yuanming Zhang, Xiaoxiao Huo, Qilun Lu, Guoyu Chen, Liangyong Hu, Projection segmentation-based image recognition technology for automatic reading of gas meter, 2024, 100, 09555986, 102707, 10.1016/j.flowmeasinst.2024.102707

Mathematical Biosciences and Engineering

Robust table recognition for printed document images

Related Papers:

Abstract

1. Introduction

2. Image preprocessing

2.1. Denoise

2.2. Binarization

2.3. Correction

3. Table lines extraction and cells location

3.1. Table lines extraction

3.2. Table cells location

4. Recognition of printed characters

5. Design of recognition system

6. Conclusion

Acknowledgement

Conflict of Interests

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog