Pytesseract, OCR 정확도 향상

debugcn 에 게시 Dev

스실

.NET의 이미지에서 텍스트를 추출하고 싶습니다 python. 이를 위해 pytesseract. 이미지에서 텍스트 추출을 시도했을 때 결과가 만족스럽지 않았습니다. 나는 또한 이것을 거쳐 아래 나열된 모든 기술을 구현했습니다. 그러나 그것은 잘 수행되지 않는 것 같습니다.

영상:

암호:

import pytesseract
import cv2
import numpy as np

img = cv2.imread('D:\\wordsimg.png')

img = cv2.resize(img, None, fx=1.2, fy=1.2, interpolation=cv2.INTER_CUBIC)

img = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)

kernel = np.ones((1,1), np.uint8)
img = cv2.dilate(img, kernel, iterations=1)
img = cv2.erode(img, kernel, iterations=1)

img = cv2.threshold(cv2.medianBlur(img, 3), 0, 255, cv2.THRESH_BINARY + cv2.THRESH_OTSU)[1]

pytesseract.pytesseract.tesseract_cmd = 'C:\\Program Files\\Tesseract-OCR\\tesseract.exe'
    
txt = pytesseract.image_to_string(img ,lang = 'eng')

txt = txt[:-1]

txt = txt.replace('\n',' ')

print(txt)

산출:

t hose he large form might light another us should took mountai house n story important went own own thought girl over family look some much ask the under why miss point make mile grow do own school was

원치 않는 공간이 하나라도 많은 비용이들 수 있습니다. 결과가 100 % 정확하기를 바랍니다. 어떤 도움을 주시면 감사하겠습니다. 감사!

Bfris

크기를 1.2에서 2로 변경하고 모든 전처리를 제거했습니다. psm 11 및 psm 12로 좋은 결과를 얻었습니다.

import pytesseract
import cv2
import numpy as np

img = cv2.imread('wavy.png')

#  img = cv2.resize(img, None, fx=1.2, fy=1.2, interpolation=cv2.INTER_CUBIC)
img = cv2.resize(img, None, fx=2, fy=2)

img = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)

kernel = np.ones((1,1), np.uint8)
#  img = cv2.dilate(img, kernel, iterations=1)
#  img = cv2.erode(img, kernel, iterations=1)

#  img = cv2.threshold(cv2.medianBlur(img, 3), 0, 255, cv2.THRESH_BINARY + cv2.THRESH_OTSU)[1]

cv2.imwrite('thresh.png', img)

pytesseract.pytesseract.tesseract_cmd = 'C:\\Program Files (x86)\\Tesseract-OCR\\tesseract.exe'
    
for psm in range(6,13+1):
    config = '--oem 3 --psm %d' % psm
    txt = pytesseract.image_to_string(img, config = config, lang='eng')
    print('psm ', psm, ':',txt)

이 config = '--oem 3 --psm %d' % psm줄은 문자열 보간 (%) 연산자 를 %d사용하여 정수 (psm)로 바꿉니다. 정확히 무엇을하는지 모르겠지만 oem사용하는 습관이 생겼습니다. psm이 답변의 끝에서 더 자세히 알아보십시오 .

psm  11 : those he large form might light another us should name

took mountain story important went own own thought girl

over family look some much ask the under why miss point

make mile grow do own school was

psm  12 : those he large form might light another us should name

took mountain story important went own own thought girl

over family look some much ask the under why miss point

make mile grow do own school was

psm페이지 분할 모드의 약자입니다. 다른 모드가 무엇인지 정확히 모르겠습니다. 설명에서 코드가 무엇인지 알 수 있습니다. 목록을 가져올 수 있습니다.tesseract --help-psm

Page segmentation modes:
  0    Orientation and script detection (OSD) only.
  1    Automatic page segmentation with OSD.
  2    Automatic page segmentation, but no OSD, or OCR. (not implemented)
  3    Fully automatic page segmentation, but no OSD. (Default)
  4    Assume a single column of text of variable sizes.
  5    Assume a single uniform block of vertically aligned text.
  6    Assume a single uniform block of text.
  7    Treat the image as a single text line.
  8    Treat the image as a single word.
  9    Treat the image as a single word in a circle.
 10    Treat the image as a single character.
 11    Sparse text. Find as much text as possible in no particular order.
 12    Sparse text with OSD.
 13    Raw line. Treat the image as a single text line,
       bypassing hacks that are Tesseract-specific.

이 기사는 인터넷에서 수집됩니다. 재 인쇄 할 때 출처를 알려주십시오.

침해가 발생한 경우 연락 주시기 바랍니다[email protected] 삭제

에서 수정2021-04-5

몇 마디 만하겠습니다

0리뷰

로그인참여 후 검토

Related 관련 기사

기사

Pytesseract, OCR 정확도 향상

Pytesseract, OCR 정확도 향상

OCR 용 OpenCv pytesseract

OCR 정확도를 향상시키는 방법은 무엇입니까?

Pytesseract-다른 색상의 텍스트가있는 이미지의 OCR

Pytesseract OCR 잘못된 텍스트 인식

CMUSphinx 5 속도 및 정확도 향상

Tensorflow로 테스트 정확도 향상

tesseract ocr 결과의 품질 향상

Tesseract-OCR (3.02) 인식 정확도 및 속도

ANN의 불안정한 출력 값 및 정확도 향상

쿼드를 사용하여 통합 정확도 향상

Keras RNN 정확도는 향상되지 않습니다.

랜덤 포레스트 정확도 향상

2D CNN에서 정확도 향상

WEKA Multilayer Perceptron 모델의 정확도 향상

float 나누기의 정확도 향상 (python)

Java 계산 정확도를 향상시키는 방법

핑 속도 향상

ddply 속도 향상

PyTesseract OCR이 단순 이미지에서 숫자를 읽을 수 없음

Pytesseract (OCR)가이 (쉽게 분명한) 이미지를 인식하지 못함

다중 클래스 이미지 분류기의 정확도 향상

SQL Exists 확장 성 향상

RAID 0 속도 향상

DataTable 반복 속도 향상

계산 속도 향상

REST API 속도 향상

목록 속도 향상

gzip 압축 속도 향상

gzip 압축 속도 향상