java opencv检测文字在图片中的位置基于opencv的图片文字识别

转载

doscommand 2024-02-09 07:55:44

文章标签 opencv 计算机视觉 python 人工智能开发语言 文章分类 Java 后端开发

导读

本文将介绍使用OpenCV+OCR实现环形文字识别的详细步骤和代码演示。

背景介绍

光学字符识别(OCR)场景中有很多特殊情况，比如噪声、脏污、倾斜、变形等，都会对识别造成影响。环形文字也是其中一种，我们通常不能直接识别它们，而是先将文字转换到水平方向，再做识别。如下图所示：

java opencv检测文字在图片中的位置基于opencv的图片文字识别_开发语言

如果我们直接识别，很容易识别失败，那怎么办呢？下面来详细介绍上图文字的识别步骤，也可看做环形文字识别的一般步骤。

详细实现步骤

【1】查找定位图中的圆形轮廓。定位圆形可以使用一般的Blob分析，也可使用霍夫圆变换实现，这里因为圆形比较规则且分明，直接使用霍夫圆变换即可，代码和检测效果如下：

gray = cv2.cvtColor(img,cv2.COLOR_BGR2GRAY)
gray = cv2.medianBlur(gray,3)
circles = cv2.HoughCircles(gray,cv2.HOUGH_GRADIENT,1,100,\
         param1=200,param2=30,minRadius=200,maxRadius=300)


isNG = False
if circles is None:
  print("找圆失败！")
  
else:
  circles = np.uint16(np.around(circles))
  a, b, c = circles.shape
  print (circles.shape)
  for i in range(b):
    cv2.circle(img_copy, (circles[0][i][0], circles[0][i][1]), circles[0][i][2], (0, 0, 255), 3, cv2.LINE_AA)
    cv2.circle(img_copy, (circles[0][i][0], circles[0][i][1]), 2, (0, 255, 0), 3, cv2.LINE_AA) # draw center of circle
  cv2.imshow("findCircle", img_copy)

java opencv检测文字在图片中的位置基于opencv的图片文字识别_计算机视觉_02

【2】基于找到的圆做极坐标变换，将文字转换到水平方向。

x = circles[0][i][0] - circles[0][i][2]
y = circles[0][i][1] - circles[0][i][2]
w = h = 2 * circles[0][i][2]
center = (circles[0][i][0], circles[0][i][1])
radius = circles[0][i][2]
C = 2 * math.pi * radius
print(C,radius)


ROI = img[y:y+h,x:x+w].copy()
cv2.imshow('ROI',ROI)
trans_center = (center[0]-x, center[1]-y)
polarImg = cv2.warpPolar(ROI,(int(radius),int(C)),trans_center,radius,cv2.INTER_LINEAR + cv2.WARP_POLAR_LINEAR)
polarImg = cv2.flip(polarImg,1) #镜像
polarImg = cv2.transpose(polarImg)#转置
cv2.imshow('polarImg',polarImg)
cv2.imwrite('polarImg.png',polarImg)

java opencv检测文字在图片中的位置基于opencv的图片文字识别_人工智能_03

【3】文字识别。使用EasyOCR或PaddleOCR均可，这里使用EasyOCR。

result = reader.readtext(polarImg)
print(result)
if len(result) > 0:
  for i in range(0,len(result)):
    print(result[i][1])
    if(result[i][2] < 0.4):
      continue
    for j in range(4):
      if j > 0:
        cv2.line(polarImg,(tuple(result[i][0][j-1])),(tuple(result[i][0][j])),(0,255,0),2,cv2.LINE_AA)
    cv2.line(polarImg,(tuple(result[i][0][0])),(tuple(result[i][0][3])),(0,255,0),2,cv2.LINE_AA)  
    strText = result[i][1].replace(' ','')
    cv2.putText(polarImg,strText,(result[i][0][3][0],result[i][0][3][1]+20),0,0.8,(0,0,255),2)


  cv2.imshow('polarImg-OCR',polarImg)

java opencv检测文字在图片中的位置基于opencv的图片文字识别_开发语言_04

【4】极坐标反变换，将包含识别结果的图像还原成圆形。

polarImg = cv2.flip(polarImg,0) #镜像
polarImg = cv2.transpose(polarImg)#转置
polarImg_Inv = cv2.warpPolar(polarImg,(w,h),trans_center,radius,cv2.INTER_LINEAR + \
                       cv2.WARP_POLAR_LINEAR + cv2.WARP_INVERSE_MAP)
cv2.imshow('polarImg_Inv',polarImg_Inv)

java opencv检测文字在图片中的位置基于opencv的图片文字识别_opencv_05

【5】创建圆形mask图像做copyTo操作，使圆形外部图像与原图保持一致。

mask = np.zeros((h,w,1),np.uint8)
  cv2.circle(mask,trans_center,radius-3,(255,255,255),-1, cv2.LINE_AA)
  cv2.imshow('mask', mask)
  ROI = img[y:y+h,x:x+w]
  for i in range(0,ROI.shape[0]):
    for j in range(0, ROI.shape[1]):
      if mask[i,j] > 0:
        ROI[i,j] = polarImg_Inv[i,j]
  cv2.imshow('result', img)
  cv2.waitKey(0)

java opencv检测文字在图片中的位置基于opencv的图片文字识别_人工智能_06