首页
学习
活动
专区
圈层
工具
发布
社区首页 >专栏 >OCR material

OCR material

作者头像
bear_fish
发布2018-09-19 12:41:52
发布2018-09-19 12:41:52
14.1K0
举报

Papers

End-to-End Text Recognition with Convolutional Neural Networks

  • paper: http://www.cs.stanford.edu/~acoates/papers/wangwucoatesng_icpr2012.pdf
  • PhD thesis: http://cs.stanford.edu/people/dwu4/HonorThesis.pdf

Word Spotting and Recognition with Embedded Attributes

  • paper: http://ieeexplore.ieee.org.sci-hub.org/xpl/articleDetails.jsp?arnumber=6857995&filter%3DAND%28p_IS_Number%3A6940341%29

Reading Text in the Wild with Convolutional Neural Networks

  • arxiv: http://arxiv.org/abs/1412.1842
  • homepage: http://www.robots.ox.ac.uk/~vgg/publications/2016/Jaderberg16/
  • demo: http://zeus.robots.ox.ac.uk/textsearch/#/search/
  • code: http://www.robots.ox.ac.uk/~vgg/research/text/

Deep structured output learning for unconstrained text recognition

  • intro: “propose an architecture consisting of a character sequence CNN and an N-gram encoding CNN which act on an input image in parallel and whose outputs are utilized along with a CRF model to recognize the text content present within the image.”
  • arxiv: http://arxiv.org/abs/1412.5903

Deep Features for Text Spotting

Reading Scene Text in Deep Convolutional Sequences

  • arxiv: http://arxiv.org/abs/1506.04395

DeepFont

DeepFont: Identify Your Font from An Image

  • arxiv: http://arxiv.org/abs/1507.03196

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Recursive Recurrent Nets with Attention Modeling for OCR in the Wild

  • arxiv: http://arxiv.org/abs/1603.03101

Writer-independent Feature Learning for Offline Signature Verification using Deep Convolutional Neural Networks

  • arxiv: http://arxiv.org/abs/1604.00974

DeepText

DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images

  • arxiv: http://arxiv.org/abs/1605.07314

End-to-End Interpretation of the French Street Name Signs Dataset

End-to-End Subtitle Detection and Recognition for Videos in East Asian Languages via CNN Ensemble with Near-Human-Level Performance

Smart Library: Identifying Books in a Library using Richly Supervised Deep Scene Text Reading

Text Detection

Object Proposals for Text Extraction in the Wild

Text-Attentional Convolutional Neural Networks for Scene Text Detection

  • arxiv: http://arxiv.org/abs/1510.03283

Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network

  • arxiv: http://arxiv.org/abs/1603.09423

Synthetic Data for Text Localisation in Natural Images

  • intro: CVPR 2016
  • project page: http://www.robots.ox.ac.uk/~vgg/data/scenetext/
  • arxiv: http://arxiv.org/abs/1604.06646
  • paper: http://www.robots.ox.ac.uk/~vgg/data/scenetext/gupta16.pdf
  • github: https://github.com/ankush-me/SynthText

Scene Text Detection via Holistic, Multi-Channel Prediction

  • arxiv: http://arxiv.org/abs/1606.09002

Detecting Text in Natural Image with Connectionist Text Proposal Network

TextBoxes: A Fast Text Detector with a Single Deep Neural Network

Text Recognition

Sequence to sequence learning for unconstrained scene text recognition

  • intro: master thesis
  • arxiv: http://arxiv.org/abs/1607.06125

Drawing and Recognizing Chinese Characters with Recurrent Neural Network

Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition

Stroke Sequence-Dependent Deep Convolutional Neural Network for Online Handwritten Chinese Character Recognition

Breaking Captcha

Using deep learning to break a Captcha system

Breaking reddit captcha with 96% accuracy

I’m not a human: Breaking the Google reCAPTCHA

Neural Net CAPTCHA Cracker

Recurrent neural networks for decoding CAPTCHAS

Reading irctc captchas with 95% accuracy using deep learning

端到端的OCR:基于CNN的实现

  • blog: http://blog.xlvector.net/2016-05/mxnet-ocr-cnn/

I Am Robot: (Deep) Learning to Break Semantic Image CAPTCHAs

  • intro: automatically solving 70.78% of the image reCaptchachallenges, while requiring only 19 seconds per challenge. apply to the Facebook image captcha and achieve an accuracy of 83.5%
  • paper: http://www.cs.columbia.edu/~polakis/papers/sivakorn_eurosp16.pdf

Handwritten Recognition

High Performance Offline Handwritten Chinese Character Recognition Using GoogLeNet and Directional Feature Maps

Recognize your handwritten numbers

https://medium.com/@o.kroeger/recognize-your-handwritten-numbers-3f007cbe46ff#.jllz62xgu

Handwritten Digit Recognition using Convolutional Neural Networks in Python with Keras

  • blog: http://machinelearningmastery.com/handwritten-digit-recognition-using-convolutional-neural-networks-python-keras/

MNIST Handwritten Digit Classifier

如何用卷积神经网络CNN识别手写数字集?

  • blog: http://www.cnblogs.com/charlotte77/p/5671136.html

LeNet – Convolutional Neural Network in Python

  • blog: http://www.pyimagesearch.com/2016/08/01/lenet-convolutional-neural-network-in-python/

Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention

  • arxiv: http://arxiv.org/abs/1604.03286

MLPaint: the Real-Time Handwritten Digit Recognizer

Training a Computer to Recognize Your Handwriting

https://medium.com/@annalyzin/training-a-computer-to-recognize-your-handwriting-24b808fb584#.gd4pb9jk2

Using TensorFlow to create your own handwriting recognition engine

Building a Deep Handwritten Digits Classifier using Microsoft Cognitive Toolkit

Plate Recognition

Reading Car License Plates Using Deep Convolutional Neural Networks and LSTMs

  • arxiv: http://arxiv.org/abs/1601.05610

Number plate recognition with Tensorflow

end-to-end-for-plate-recognition

Blogs

Applying OCR Technology for Receipt Recognition

  • blog: http://rnd.azoft.com/applying-ocr-technology-receipt-recognition/
  • mirror: http://pan.baidu.com/s/1qXQBQiC

Hacking MNIST in 30 lines of Python

Projects

ocropy: Python-based tools for document analysis and OCR

Extracting text from an image using Ocropus

  • blog: http://www.danvk.org/2015/01/09/extracting-text-from-an-image-using-ocropus.html

CLSTM : A small C++ implementation of LSTM networks, focused on OCR

caffe-ocr: OCR with caffe deep learning framework

Digit Recognition via CNN: digital meter numbers detection

Attention-OCR: Visual Attention based OCR

umaru: An OCR-system based on torch using the technique of LSTM/GRU-RNN, CTC and referred to the works of rnnlib and clstm

Tesseract.js: Pure Javascript OCR for 62 Languages

DeepHCCR: Offline Handwritten Chinese Character Recognition based on GoogLeNet and AlexNet (With CaffeModel)

Datasets

COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images

  • homepage: http://vision.cornell.edu/se3/coco-text/
  • arxiv: http://arxiv.org/abs/1601.07140

Videos

LSTMs for OCR

Resources

Scene Text Localization & Recognition Resources

本文参与 腾讯云自媒体同步曝光计划,分享自作者个人站点/博客。
原始发表:2017年01月11日,如有侵权请联系 cloudcommunity@tencent.com 删除

本文分享自 作者个人站点/博客 前往查看

如有侵权,请联系 cloudcommunity@tencent.com 删除。

本文参与 腾讯云自媒体同步曝光计划  ,欢迎热爱写作的你一起参与!

评论
登录后参与评论
0 条评论
热度
最新
推荐阅读
目录
  • Papers
    • DeepFont
    • DeepText
  • Text Detection
  • Text Recognition
  • Breaking Captcha
  • Handwritten Recognition
  • Plate Recognition
  • Blogs
  • Projects
  • Datasets
  • Videos
  • Resources
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档