Consent

This site uses third party services that need your consent. Learn more

Skip to content

Blog

AI for transcribing text on screenshots

Headshot of Craig.

Craig Wright

Now and again, I’ll get a screenshot that has some text on it that I’d like to use. In the old days, I’d have had to type it out manually - BORING!!! But now that AI is upon us, it can be done really quickly and easily. The only downside is that on a free ChatGPT account, there is a limit on how many files you can do per day.

All you need to do is:

  1. Enter a prompt like:

    “can you read the text on an image and reproduce the text?”

    The AI will answer and ask you to upload the image.

  2. Upload the image and voila! The AI gives you the text from the image.

  3. Copy and paste the text into whatever app you need it for. Edit the text if necessary.

Examples of how I’ve used AI transcription

Here are some examples of how I’ve used AI transcription on projects:

  • Improving UI text and descriptions

    Sometimes, the developers I work with enter the most confusing, jargon-riddled, or just plain overlong feature names and descriptions. When I’ve figured out what the feature actually does, I’ll suggest improvements (UX writing!). For that, I like to include the original text and then my version. I take a quick screenshot, upload it to Chat GPT and ask it to give me the text. Copy and paste into the JIRA ticket and it’s job done.

  • Alt text for images

    This is a less common use, but sometimes I’ll have images that contain text and that text would be useful in the alt text description. For example, I’ve used AI to transcribe social media comments in screenshots and to get the text from callouts on diagrams.

  • Getting text from lost documents

    Let’s say I have a screenshot of a document in a knowledge base article. I need to update the screenshot, but I’ve lost the original document. This is where AI can make it easier to recreate the document - I ask it to transcribe the screenshot, then copy and paste the text into a new document. Hey presto! I have quickly remade the document and can take a new screenshot of it. And actually save the document somewhere safe this time.

Posted under ChatGPT

Last modified: 15 June 2024

Headshot of Craig.

Craig Wright is an experienced technical writer based in Chesterfield, UK.  He hates writing about himself in the third person, so I shall stop now.

Always interested in new content writing opportunities. Remote working preferred.

Get in touch

Let’s talk about your project

I’m here to help and offer my expertise as a technical writer. Get in touch and let me know what you need.