I want to talk to you about OCR

tony1
New Member

Hello app creators. Weโ€™re working on a new optical character recognition (OCR) feature, and I would love to hear from you all about how you would use this feature in your apps. You can reply to this thread or shoot me an email at tony@appsheet.com.

34 84 8,286
84 REPLIES 84

@Tony Awsome. I would love to be a beta tester.

Awesome!

tony1
New Member

@Grant_Stead Text-only for now. But thatโ€™s a good idea.

Yes, this would excellent for our workflow, currently we scan a lot of forms containing Code 39, QR, and a text box. We have to press the scan button multiple times per form. It would be great if we could scan codes and text all at once and use something like regular expressions to match them to the right field. If you need help testing or developing let me know!

This is perfect! I definitely could use this!!!

As a service contractor I take pictures of model and serial numbers everyday! Then squint at the small preview in appsheet of the Model, serial etc and manually enter it into the model/serial field.

If you need a beta tester sign me UP!!!

Looks great!

Any word on what tiered plan this feature will be available for?

Cricic
New Member

Is possibile to use OCR also with PDF DOCUMENT or only Imagine?

@tony Any updates with this?

tony1
New Member

Hi everyone, Iโ€™ve added you all to the list of beta testers. This feature should be available to you on Monday afternoon (Seattle time). Please share any feedback or questions you have.

Thanks Tony. I am currently occupied on other stuff but will try to get back to this so I can provide some feedback. Appreciate your efforts to add capabilities like this.

Please @tony could you add 2 accounts for me?
622224
651676
I will test this for 2 clients on Tuesday and Wednesday.

Hi Tony, would it be possible to add me to the list of beta testers as well?

I could see this feature being used to digitize BOMs for our shop and/or contractor on site

@tony, please add Account 323951 to the Beta Tester List as well, please

Cool

Thanks Tony,

I am having issues creating an OCR model when I only have one output column (vehicle registration plate). Are you aware of any issues, I have 5 sets of test data however I am getting an error saying โ€œUnable to train your OCR modelโ€.

Any feedback would be much appreciated.

Thanks!

Would appreciate being a Beta tester too please. Acnt - 689755 and 1001121

sin
New Member

Hi Tony, please add me as well for Beta Tester. Acct# 479408

Hi! That would have many applications in my field of work! Canโ€™t wait to use it!!! Please keep me informed

Cricic
New Member

Hi Tony, please add me as well for Beta Tester. Acct# 469193

@tony Hi could you add my 129805 account please, thanks.

@tony First: I got it to work Thank you for that useful tool.

Here are my impressions:

  • If you get the Error: โ€œCould not train your OCR model, please contact support@appsheet.comโ€, the example image(s) is/are not good. Start with just one and add one by one.
  • The example Images have to have a good background. An image like this wonโ€™t work.

    It also wonโ€™t work if they have to less background like this one:
    2X_a_a8ea10201df738dcaa6fbb142566594252c0f88b.jpeg
  • After generating an OCR Model: When you add a column to your table and then hit regenerate, it will crash the App. Error: Object reference not set to an instance of an object. In the Columns Tab, there will be no columns. It also doesnโ€™t help to delete the table and to add it again. You have to generate a new App. Workaround: Before changing the column structure, you have to delete the OCR model.
  • LongText is not working. If you have 2 lines like in my example, you have to separate them in 2 columns. After that you can do a CONCATENATE() in another column to combine those 2 columns.
  • In my example, there is text field and in the same line also a Date. Sometimes, it separates them correctly in 2 columns. Sometimes it puts all in 1 column like FL19/026209 11.07.2019. Seems to depend on the image captured.
  • Date has to be Text type. It wonโ€™t work with Date type. Error: This entry is invalid. But perhaps this has to do with our German Date format. Workaround: Use another VC to turn itthe Text Column into a Date type with the expression: DATE(MID([DateColumnText],4,2)&"."&LEFT([DateColumnText],2)&"."&RIGHT([DateColumnText],4))
  • Characters like โ€œรผโ€ are sometimes recognized as โ€œuโ€.
  • In the OCR Model I can see no images:
  • You need to have internet connection to use the OCR. Am I right?

EDIT:

  • Double Quotes are not handled correctly.
    "SZ ""e-line"" MJ would be regognized as
    "SZ""e-line MJ

Hi Tony
Please add account 808816

tony1
New Member

@Fabian Weโ€™re looking into it, thanks.

Weโ€™ll add others to our beta list soon.

Hi Tony
Please add account 474850
Thanks

If you are still looking to add some beta testers, would love to take part of this.
Account ID: 865909

11123
New Member

Please add me as well for Beta Tester. Acct# 94164

@tony I can see images now, but no highlighted regions.


In my case there are numbers and decimals. However I found out, that it wonโ€™t work well will decimal and number columns. I had to use text columns. Then the image was recognized better.

Tarang
New Member

I would like to use it to read various meters on our premises. Like Electric meter, water meter, etc.

Please add 288391 to beta list.

I am really interested in the progress of this feature, Iโ€™ve had multiple people reach out to me wondering if thereโ€™s something in the works and I love to see it being tested.

Thanks to everyone for leaving your feedback and thoughts on your tests here, they really mean a lot and provide great insight!

I canโ€™t wait for this feature to go main stream

@tony
I cannot create anymore new OCR Model. Error: Could not train your OCR model, please contact support@appsheet.com
Even when I RETRAIN an existing one with the same data, the same Error occurs.

tony1
New Member

@Fabian Weโ€™ve deployed some bug fixes to address that issue. Thank you for trying it outโ€”please give it another try and let me know how it goes.

Also, weโ€™re working to address the issues mentioned above. Iโ€™ve updated the beta list too to include a few of you who sent your IDs.

Hi Tony, could I please be added to the beta list too! My ID is 1020775

Thanks!

@tony Now I can retrain again with the same data. Even with more data. Thank you!
But when I create a new Model, it will save the model, but without any result. It will not show any images in the patterns section. I think it hasnโ€™t recognized anything.
I did this with just the same data i used before.
In one case it gives the warning:

A warning was produced while training OCR Model 'New OCR Model 2': The OCR Model builder found inconsistencies in the input data for column "CO". In row 0, the value "32" was expected but was not found; ignoring thisfield in this row for now. In row 2, the value "29" was expected but was not found; ignoring thisfield in this row for now. In row 3, the value "22" was expected but was not found; ignoring thisfield in this row for now.

But as you can see, in my data I have those values. But in other rows. Why is it expecting something in row 0?

2X_d_dc338e5ab99eff145e6c71c861ed8ad445bfd7cc.png

appName=OCRWalk-133783

Hey @tony
I would like to test the OCR.
My account is: 144963 THANKS!

@tony I see you are working hard on this feature. Itโ€™s getting smoother and smoother. Thank you!

Still having problems with LongText.
2X_b_b230e879be0d361e6657518366467c17c7330961.jpeg

"SZ ""e-line"" MJ 12, Digitaldruck
PU-beschichtet

When I try using a LongText Column it catches only the first line.
So I split it in 2 columns. But then it rarely catches the second line.
Any idea?

appName=OCRKM-133783

Show Type Image with a GIF โ€œhttps://media.giphy.com/media/sSgvbe1m3n93G/giphy.gifโ€ and with show_if condition AND(ISNOTBLANK([Image]) , ISBLANK([Any OCR column]))
To make it clear for the user that he has to wait a bit

Smart

@tony I love the UX of Anyline. It shows up a box and interacts with the numbers you want to scan. That helps the user to know where to turn the camera and it would avoid โ€œuncleanโ€ background.

2X_2_2daaa1d6a6d15a77d522b8e37847408bda092eb2.jpeg
They even offer a Barcoe Detection as well.
Could we do something similar with AppSheet?

Here is a demo Video

@Fabian Appsheet already used Scandit for barcode scanning, and it does OCRโ€ฆ Not sure how all this is married up with the machine learning, or the feature sets of each, just throwing it out thereโ€ฆ