《電子技術(shù)應(yīng)用》
您所在的位置:首頁 > 其他 > 設(shè)計(jì)應(yīng)用 > 基于分割的自然場(chǎng)景下文本檢測(cè)方法與應(yīng)用
基于分割的自然場(chǎng)景下文本檢測(cè)方法與應(yīng)用
2021年電子技術(shù)應(yīng)用第2期
陳小順,王良君
江蘇大學(xué) 計(jì)算機(jī)科學(xué)與通信工程學(xué)院,江蘇 鎮(zhèn)江212013
摘要: 自然場(chǎng)景文本檢測(cè)識(shí)別在智能設(shè)備中應(yīng)用廣泛,而對(duì)文本識(shí)別的第一步則是對(duì)文本進(jìn)行精確的定位檢測(cè)。對(duì)于現(xiàn)有像素分割方法PixelLink中存在的彎曲文本定位包含過多背景信息、檢測(cè)圖像后處理不足兩個(gè)主要問題提出改進(jìn)。引入特征通道注意力機(jī)制,關(guān)注生成特征圖中特征通道間的權(quán)重關(guān)系,提升檢測(cè)方法的魯棒性。接著改變公開數(shù)據(jù)集標(biāo)注形式,將坐標(biāo)點(diǎn)表示為一串帶有方向的序列形式,在LSTM模型中進(jìn)行多邊形框的學(xué)習(xí)與框定。最后在公開數(shù)據(jù)集和自建數(shù)據(jù)集上進(jìn)行文本檢測(cè)測(cè)試。實(shí)驗(yàn)表明,改進(jìn)的檢測(cè)方法在各數(shù)據(jù)集中表現(xiàn)優(yōu)于原方法,與當(dāng)前領(lǐng)先方法精度相近,能夠在各個(gè)環(huán)境中完成對(duì)文本的檢測(cè)功能。
中圖分類號(hào): TN911.73;TP391.4
文獻(xiàn)標(biāo)識(shí)碼: A
DOI:10.16157/j.issn.0258-7998.200316
中文引用格式: 陳小順,王良君. 基于分割的自然場(chǎng)景下文本檢測(cè)方法與應(yīng)用[J].電子技術(shù)應(yīng)用,2021,47(2):54-57.
英文引用格式: Chen Xiaoshun,Wang Liangjun. Text detection and application in natural scene based on segmentation[J]. Application of Electronic Technique,2021,47(2):54-57.
Text detection and application in natural scene based on segmentation
Chen Xiaoshun,Wang Liangjun
School of Computer Science and Telecommunication Engineering, Jiangsu University,Zhenjiang 212013,China
Abstract: Text recognition in nature scene is currently applied in various intelligence equipment. The first step of text recognition is to precisely locate the text. In the Pixel Link text location methods, there are mainly two problems: too much background information is incorporated in the text region, and the test accuracy is insufficient. Aiming at these issues, an improved text location method was proposed to precisely locate the text in the natural scene. At first, an attention mechanism was incorporated into the original network. By focusing on the weight relationship between feature channels in the generated feature map, one can improve the weight coefficient of effective feature channels, and suppress the weight of inefficient or invalid feature channels. In the second, by changing the form of data set annotation, the coordinate points can be expressed as a series of sequence forms, so that the text lines can be framed adaptively in the LSTM model. At last, the located object is rotated according to the angle between a pair of vertexes in the polygon frame, and is subsequently fed to the text recognition interface to obtain the final character. Finally, the text detection test is carried out on the open data set and self-built data set. The experimental results show that the improved detection method is superior to the original method on different dataset, and the accuracy is similar to the current leading method.
Key words : pixel segmentation;attention mechanism;LSTM;natural scene text detection

0 引言

    視覺圖像是人們獲取外界信息的主要來源,文本則是對(duì)事物的一種凝練描述,人通過眼睛捕獲文本獲取信息,機(jī)器設(shè)備的眼睛則是冰冷的攝像頭。如何讓機(jī)器設(shè)備從拍照獲取的圖像中準(zhǔn)確檢測(cè)識(shí)別文本信息逐漸為各界學(xué)者關(guān)注。

    現(xiàn)代文本檢測(cè)方法多為基于深度學(xué)習(xí)的方法,主要分為基于候選框和基于像素分割的兩種形式。本文選擇基于像素分割的深度學(xué)習(xí)模型作為文本檢測(cè)識(shí)別的主要研究方向,能夠同時(shí)滿足對(duì)自然場(chǎng)景文本的精確檢測(cè),又能保證后續(xù)設(shè)備功能(如語義分析等功能)的拓展。




本文詳細(xì)內(nèi)容請(qǐng)下載:http://m.xxav2194.com/resource/share/2000003385




作者信息:

陳小順,王良君

(江蘇大學(xué) 計(jì)算機(jī)科學(xué)與通信工程學(xué)院,江蘇 鎮(zhèn)江212013)

此內(nèi)容為AET網(wǎng)站原創(chuàng),未經(jīng)授權(quán)禁止轉(zhuǎn)載。
主站蜘蛛池模板: 91精品久久久久| 久久国内精品自在自线400部o| 美女黄色一级毛片| 国产精品国三级国产aⅴ| 一本色道久久88—综合亚洲精品| 暖暖免费高清日本一区二区三区| 亚洲综合校园春色| 美女免费视频黄的| 国产成人v爽在线免播放观看| 99久久99这里只有免费费精品| 把腿扒开做爽爽视频在线看| 亚洲AV无码精品蜜桃| 污污网站免费观看| 又粗又硬又大又爽免费观看 | 99久久99久久精品国产片果冻| 插插插综合视频| 九九精品视频在线| 永久黄色免费网站| 日韩一区二区三区免费体验| 亚洲视频天天射| 老太脱裤让老头玩ⅹxxxx| 天堂网中文字幕| 久久99国产精品视频| 欧美www在线观看| 亚洲色av性色在线观无码| 美女扒开尿口给男人爽免费视频 | 国产精品亚洲欧美日韩区| caoporn成人| 最近的中文字幕国语电影直播| 亲密爱人完整版在线观看韩剧| 激情五月亚洲色图| 成人在线播放av| 久久精品国产亚洲av日韩| 欧美日韩国产人成在线观看| 免费国产a国产片高清网站| 色偷偷一区二区无码视频| 国内精品区一区二区三| 一级毛片一级毛片一级级毛片| 欧美国产日本高清不卡| 伊人影院在线视频| 美女免费网站xx美女女女女女女bbbbbb毛片|