Skip to content →

The Idea Place Posts

ChatGPT Needs to Learn More on Screen Readers, Critical Factual Errors Are Present

Last week a colleague from my days at Microsoft wrote an excellent analysis of the current functionality using ChatGPT and a basic question in HTML. I had been dabbling with questions from time to time around accessibility and ChatGPT myself but hadn’t written anything formal up. Now I have.

This evening with a bit of time, I decided to see what the AI technology knew about reading web pages with several leading screen readers. The results confirmed a comment I had made in response to my colleague’s article where I said there was a world of difference between writing well, which ChatGPT does, and writing accurately, which ChatGPT certainly does not due when it comes to reading web pages with a screen reader.

My basic question was as follows:

How do you read a web page using the <name of screen reader>?

Part of what’s key with any AI model is the content that is used for training. I suspect ChatGPT has scoured more content talking about the JAWS screen reader than any other. Answers for not only JAWS but also NVDA and Narrator asserted I should use the Virtual Cursor to read the web page. Clearly there is some model training bias going on in these answers.

ChatGPT was 100% incorrect in instructions for starting some screen readers. The technology instructed me to press the Insert key on my keyboard to start NVDA and Insert+j for JAWS. It was equally wrong on stopping these screen readers also.

In the case of VoiceOver on the iPhone and TalkBack, I guess ChatGPT wanted to make sure I really had the screen reader running. One of the first steps in both cases was to go into settings and then turn the screen reader on. Then further on, after loading the web page, I was given another set of instructions to start the screen reader.

Using the gestures you are told to use for VoiceOver, you wouldn’t be doing much reading but rather a lot of nothing. ChatGPT instructs you to use three-finger swipes to read the web page. This is in fact VoiceOver’s scroll gesture and there is nothing to scroll horizontally on web pages and it is certainly not how you would read.

It is also interesting that for the Windows screen readers, Internet Explorer is still one of the browsers mentioned. Additionally, Edge gets no attention at all.

Anyone familiar with the basic task of reading a web page with a screen reader will recognize other factual errors here.

Details matter! As these sorts of technologies are deployed for broad societal use, it is critical that those deploying the technology recognize this and work not only to eliminate factual errors from making it into the environment but also have ways for immediate correction when such are found.

I understand this is a preview and all but it is critical that such technology is scrutinized. Again, writing well is a far cry from writing accurately and in fact doing the first can compound errors in the second.

Here are the full responses for my questions to ChatGPT.

JAWS

JAWS is a popular screen reader that helps visually impaired users navigate and interact with web pages. Here are the basic steps to read a web page using JAWS

  1. Open the web page: Use your keyboard to navigate to the web address bar and type in the URL of the page you want to read.
  2. Start JAWS: Launch JAWS by pressing Insert + J on your keyboard.
  3. Navigate with the cursor: Use the arrow keys on your keyboard to move the virtual cursor up and down the page. JAWS will read the content of the page as you move the cursor.
  4. Interact with headings: Press the H key to move from one heading to the next. This can be useful for quickly navigating to different sections of a page.
  5. Use quick navigation keys: JAWS provides several quick navigation keys that allow you to move to specific types of content, such as headings, links, and form elements. For example, press the T key to move to the next table, or the B key to move to the next button.
  6. Read specific elements: Use the virtual cursor to move to a specific element, such as a link or a form field, and then press the Enter key to activate it.
  7. Read context-sensitive help: Press the Insert key followed by the Question Mark key to access JAWS’ context-sensitive help, which provides information on how to interact with the current element.
  8. Stop JAWS: Press Insert + Q to stop JAWS.

It’s important to note that JAWS provides many other features and keyboard commands that can help you navigate and interact with web pages more efficiently. You can consult the JAWS User Guide or attend a JAWS training course for more information on these advanced features.

VoiceOver

VoiceOver is the built-in screen reader for iOS devices, including the iPhone. Here’s how to use VoiceOver to read a web page:

  1. Enable VoiceOver: Go to Settings > General > Accessibility, and turn on the VoiceOver switch.
  2. Launch Safari: Open the Safari app on your iPhone.
  3. Navigate to a web page: Type in the URL of the web page you want to read or use a search engine to find the page.
  4. Start VoiceOver: Tap the Home button three times quickly to start VoiceOver.
  5. Read the page: Swipe left or right with three fingers to move from one item to the next on the page. VoiceOver will read the text or describe the content as you swipe.
  6. Interact with elements: Double-tap with one finger to activate a link or a button. VoiceOver will announce the action and navigate to the next page if you activated a link.
  7. Navigate headings: Swipe up or down with three fingers to move through headings, tables, or other structural elements on the page.
  8. Read specific elements: Touch and hold an element with one finger to hear more information about it, such as the URL of a link.
  9. Stop VoiceOver: Tap the Home button three times quickly to stop VoiceOver.

It’s important to note that VoiceOver provides many other features and gestures that can help you navigate and interact with web pages more efficiently. You can access the VoiceOver User Guide on your iPhone or attend a VoiceOver training course for more information on these advanced features.

Narrator

Narrator is a screen reader built into Windows operating systems. Here’s how to use Narrator to read a web page:

  1. Enable Narrator: Press Windows + Ctrl + Enter to turn on Narrator.
  2. Launch a web browser: Open Internet Explorer, Google Chrome, or another web browser on your computer.
  3. Navigate to a web page: Type in the URL of the web page you want to read or use a search engine to find the page.
  4. Start reading: Press the Caps Lock key to start reading the page. Narrator will read the text of the page as you move the cursor with your mouse or the arrow keys on your keyboard.
  5. Interact with elements: Use the Tab key to move from one element to the next on the page. Press the Enter key to activate a link or a button.
  6. Navigate headings: Press the H key to move from one heading to the next on the page.
  7. Read specific elements: Use the virtual cursor to move to a specific element, such as a link or a form field, and then press the Enter key to activate it.
  8. Stop Narrator: Press Windows + Ctrl + Enter to turn off Narrator.

It’s important to note that Narrator provides many other features and keyboard commands that can help you navigate and interact with web pages more efficiently. You can consult the Narrator User Guide or attend a Narrator training course for more information on these advanced features.

NVDA

NVDA (NonVisual Desktop Access) is a free and open-source screen reader for Windows operating systems. Here’s how to use NVDA to read a web page:

  1. Download and install NVDA: Download the NVDA installer from the official NVDA website and follow the instructions to install it on your computer.
  2. Launch a web browser: Open Internet Explorer, Google Chrome, or another web browser on your computer.
  3. Navigate to a web page: Type in the URL of the web page you want to read or use a search engine to find the page.
  4. Start NVDA: Press the Insert key on your keyboard to start NVDA.
  5. Read the page: Use the arrow keys on your keyboard to move the virtual cursor up and down the page. NVDA will read the text of the page as you move the cursor.
  6. Interact with elements: Use the Tab key to move from one element to the next on the page. Press the Enter key to activate a link or a button.
  7. Navigate headings: Press the H key to move from one heading to the next on the page.
  8. Read specific elements: Use the virtual cursor to move to a specific element, such as a link or a form field, and then press the Enter key to activate it.
  9. Stop NVDA: Press the Insert key on your keyboard to stop NVDA.

It’s important to note that NVDA provides many other features and keyboard commands that can help you navigate and interact with web pages more efficiently. You can consult the NVDA User Guide or attend an NVDA training course for more information on these advanced features.

TalkBack

TalkBack is a screen reader built into Android operating systems. Here’s how to use TalkBack to read a web page:

  1. Enable TalkBack: Go to Settings > Accessibility > TalkBack and turn on the TalkBack switch.
  2. Launch a web browser: Open Google Chrome or another web browser on your Android device.
  3. Navigate to a web page: Type in the URL of the web page you want to read or use a search engine to find the page.
  4. Start TalkBack: Tap the screen two times quickly to start TalkBack.
  5. Read the page: Swipe left or right with one finger to move from one item to the next on the page. TalkBack will read the text or describe the content as you swipe.
  6. Interact with elements: Double-tap with one finger to activate a link or a button. TalkBack will announce the action and navigate to the next page if you activated a link.
  7. Navigate headings: Swipe up or down with one finger to move through headings, tables, or other structural elements on the page.
  8. Read specific elements: Touch and hold an element with one finger to hear more information about it, such as the URL of a link.
  9. Stop TalkBack: Tap the screen two times quickly to stop TalkBack.

It’s important to note that TalkBack provides many other features and gestures that can help you navigate and interact with web pages more efficiently. You can access the TalkBack User Guide on your Android device or attend a TalkBack training course for more information on these advanced features.

One Comment

The Challenge of Finding a Usable Table on the Web

Several months ago, I started a podcast with a nephew of mine. In a quest to feed what I lovingly call the monster because the podcast always needs content, I was researching some NFL statistics and it was a poignant reminder of what a challenge it is to find consistently usable tables on the web.

 

These NFL tables on individual player statistics for my basic criteria are reasonable. Column and row headers are marked. Column headers are actionable, meaning they are links. Even the sort direction is properly indicated. Great, I’m thinking here could be a handy resource.

 

Reading through the links on the page, I notice a link that says Next Gen Stats. Well, it actually says Next Gen Stats target=1 if you are using a screen reader as a result of an ARIA-Label that needs some obvious attention. Now I’m thinking this could be interesting. These Next Gen Stats are the details about how fast athletes are moving and all the other interesting data being gathered these days. Not so fast though.

 

The first page I go to here has one of the more vexing issues I experience with tables. Column headers are separated here into a separate single row table. The data itself is a second table and as a result, the data is more difficult to review–mind you, not impossible but it requires you first figure out what’s going on and then keep track of the column names as you review information.

 

Last, I opt to review some team statistics. At first glance, these tables seem potentially good, much like the individual player tables. Column and row headers are marked. That said, even though column headers are reachable from the keyboard, they are not created as links, so this is a step backwards in my opinion. On the positive, sort order is indicated.

 

Row headers in this case become a more interesting situation. With NVDA and Narrator, the team name is read as the row header such as, Packers, Vikings, Bears and Lions. JAWS adds a wrinkle because you get text such as logos/GB
Unlabeled graphic – Packers as the row header. While I didn’t fully investigate the source of this difference, my first impression is that JAWS is including part of the image source from the <picture> tag whereas Narrator and NVDA are not. I’d suspect proper alt tags on the images within the picture tag or some other markup would correct things.

 

Recall, I was just trying to feed my podcast monster here and yet this seemingly simple task of looking up a few statistics shows what a mess tables are still for accessibility.

 

Leave a Comment

Right Info Wrong Location: Accessibility Hints

In my web use, I’ve noticed a growing trend to add instructions to more experiences on the web. This week it was more than 250 words on reading a table on a financial web site and another couple hundred words on a calendar control for an airline company.

Most often, this is coming in the form of an ARIA-Label or other hidden text, aimed at explaining how to use the experience with a screen reader in particular. This is typically scoped to some sort of custom control that the site has opted to use.

On the surface, this might sound great. We have some complex experience and want to give users some tips so we’ll jam in a set of instructions. We don’t want to clutter the visual experience with information that isn’t relevant if you are not using a screen reader though, so we’ll hide this info with some web technique we’ve learned about from someplace.

As someone who’s created a number of training videos and other learning content over the years, I recognize the importance and helpfulness a good set of instructions can provide. That said, if you are building an experience where for whatever reason, you find it necessary to add instructions directly into the content, please press the pause button and ask more than once why these instructions are needed.

Often I have found that it is a case that as a “workaround” for not following accessibility guidance instructions are viewed as the solution.

Explaining why you had to break the rules, dressed up as user assistance, does not excuse the rule breaking. In addition, you can write all the instructions you want but that’s no guarantee a user is going to read them.

Even if such instructions are justified, jamming them directly into the middle of the task flow is far from ideal. It seems great on first, second and maybe third use. By about the fifth time you’ve had to use some screen reading command to skip past all these instructions though, you are starting to feel held captive to this inefficient experience.

When talking about accessibility, I will typically share some form of this topic in my presentations. Without fail, audiences will always say that the experience with lengthy instructions seems more accessible when I first compare experiences with and without these sorts of instructions. Then I toss the curve of asking how the same experiences would compare after ten uses of the two. The answers are strikingly different.

This is not to say innovation and creativity are not part of an accessible experience. The exact opposite is true. However, ask yourself if basic tasks such as picking dates, reading a table of information or many other experiences that are common on the web, are the best place to create a control unique to one web site. You are asking users to learn something new specific to your experience and toss out years of learning they’ve likely accumulated when common controls and design patterns are not used.

At minimum, ensure as I say, you press pause and ensure you understand why you are making the choices you are. Put the user front and center and recognize that the user likely wants to get something done, not learn how to use your web site.

If you find you are in a situation where instructions are still warranted, find alternatives to an essay of user education attached to a control. Help links are one great alternative. Find a way, whatever options you choose, to allow a user to stop having to sort through instructions once they are familiar if you absolutely have to use them.

Leave a Comment

Excel Added an Image Function to Add Images From a URL

Keeping up with all the changes in even the Microsoft products I personally use on a regular basis can be a challenge. I say that because I just read about a new function Excel has added to add images to a spreadsheet in a formula and it seems quite handy so far. Full details on the function are in this blog posting.

Screen readers will read the alt text on the image when you navigate to the cell. The Office Accessibility Checker will flag the item as missing alt text if that option is left out of the formula.

The full formula reference is also available

This is a feature in Insider builds of Office 365 at this time. I’m sure the Office team would value feedback if you opt to try this functionality.

Leave a Comment

Outlook Quick Steps Can Save Time and are an accessibility win for me

Over the years I’ve assisted multiple people with Outlook and one feature that I commonly use stands above the rest as far as being a time saver that most people seem to have not discovered. Quick Steps allow you to take multiple actions on an email message with just a couple mouse clicks or a single key press.

Office has a comprehensive support article on this feature.

As an example, most know about filters to sort email into different folders. As handy as this can be, I often find I might forget to check a certain folder or otherwise want to read the email before filing it away. With a Quick Step, I can turn this into a single key press. In my case CTRL+Shift+2 is my hotkey to archive any email message. I have five or six Quick Steps I routinely use and the hotkeys for those have become as automatic as CTRL+r for reply or CTRL+f for forward.

One way to start using Quick Steps is to have focus on a message in an Outlook message list in the Windows Desktop version of Outlook and press Shift+F10 and arrow down to Quick Steps. Outlook comes with some default but most likely you will want to create your own. Choose Manage Quick Steps, followed by tabbing to the New button and finally choosing Custom. From that point, choose the actions you want taken on the message, such as moving to a folder, deleting, replying and more. You can apply multiple actions. There is also an option to assign a hotkey. You are limited to key combinations of CTRL+Shift+1-9 here.

One nice benefit of Quick Steps is that it is an Outlook setting that is saved to the cloud if you are using Outlook in an environment with that option. That means as soon as I add my email account to Outlook on a new computer, my Quick Steps are present too.

You can read the support article for more details or give the feature a try and discover how it can save you time as well.

Leave a Comment

An Update on Using Excel to Track a Fantasy Football Draft

Several years ago I wrote about how I used Excel to track a fantasy football draft. Everything in that post is still relevant but I wanted to provide a short update.

Most importantly, the spreadsheet I use is now available and updated for 2022. See the instructions sheet in the document for how to use it during a fantasy football draft.

The original post talked about my league having 16 players that each fantasy owner drafts. Over the last five years that number has increased to 18.

The short version of how to use this spreadsheet is to:

  1. Update team names on the Teams sheet to reflect players in your league.
  2. During the draft, use the Players sheet to track the player selections by entering the team owner and dollar amount for each player drafted on the Players sheet.
  3. For the various tracking sheets, most notably, the Auction Tracker, use the Data:Refresh Connections command on the Data ribbon or press CTRL+Alt+F5 to have the sheets update with info from what you enter as the draft progresses.
  4. Have fun.

I have made every reasonable effort to verify the accuracy of the formulas and such in this spreadsheet. If you find an issue, please let me know in the comments here or on Twitter, where I am @kellylford.

 

Leave a Comment

Sound From Two Mountain Streams

My wife and I did some hiking in the Rocky Mountains in Colorado this week. Here are two short recordings of a couple mountain streams from our hikes. I always enjoy the sounds of rivers and streams in the mountains and the different sounds they make.

The first was a fairly slow running stream. The second was a bit faster.

I was a bit surprised at the lack of sound in the parts of the mountains we were in. Birds and such were sparse at best and we encountered few streams. The most common sound was the silence of nature. Much as I enjoyed it, it certainly does not make for compelling recording. These two short samples will have to do from this trip.

In these two recordings you hear water running. The first is quieter with a gentle flowing sound. The water is noticeably louder in the second and sounds as though it is running more rapidly.

Leave a Comment

The ADA at 32

A version of this post, with some of the same content, was published last year. A bit of reorganization and some new commentary has been added. I’ve opted to use some of the same content because I do have to ask what has changed in the last year?

If anything, I’d say 31 wasn’t a great year for The Americans with Disabilities Act (ADA) or at the very least, the future is looking less optimistic. The last couple years and months have certainly shown that societal progress is not a certainty and in fact progress can backslide amazingly fast.

July marks the 32nd anniversary of the signing of the ADA. Each year as another year goes by and we celebrate the signing of the legislation, I am torn between feeling celebratory and sad.

I do feel celebratory about the fact that the work of many allowed us to reach this point. As someone who has worked on various accessibility efforts over the years in a small way, I know all too well the toil, endless negotiations and so much more that is required to make progress. So congratulations for sure to those involved in getting this landmark legislation passed and sustained. That is assuredly worth acknowledgement on a grand scale.

Great, celebrate the reality that we have a law that at least gives some hope if it is your choice. I understand perpetual exposure to commentary that it is all trouble can be tough to experience. While you are celebrating, just remember, it isn’t as if in passing the ADA, the amendments to the ADA or other accessibility legislation, Congress has said any of the eight findings they list have gone away.

I urge anyone reading this to read the congressional findings that are listed in the legislation. Like a lot of civil rights legislation, the congressional findings detail that as a class, in this case one to which I belong, people with disabilities are not treated very well and in fact that’s an understatement. Frankly we suffer a staggering amount of outright discrimination and I’m of the opinion that far too often the level of discrimination people with disabilities experience is drastically softened when speaking about the reality of life in the U.S. today. From what I know, the same holds true around the world.

I know from numerous firsthand experiences, calling something discriminatory makes a lot of people uncomfortable. But trust me, experiencing the actual discrimination does much more.

Lest you think discrimination isn’t the right word, here’s the dictionary definition:

“The unjust or prejudicial treatment of different categories of people, especially on the grounds of ethnicity, age, sex, or disability.”

I’m not sure what else you would call much that people with disabilities experience.

To be clear, not all discrimination is intentional in my opinion. In addition, the discrimination is often not the actions of any specific individual but rather the inaction of an organization of some sort, where often well-meaning individuals can find it difficult to go against the inertia that is causing the problematic situations. I also think people get uncomfortable around the term because they think, “Of course I couldn’t discriminate.”

Just because something isn’t intentional does not mean it isn’t happening. Words do matter and how we label what people with disabilities experience daily has an impact on how it is perceived. I recognize on the continuum of circumstances in life, I’m amazingly fortunate. The fact that I can write these words on a computer as someone who’s blind speaks volumes by itself. I have a high degree of security—food, shelter, employment, freedom from war, health insurance and so much more.

If reading that there are multiple challenges is tough or that something seems discriminatory, just remind yourself that the following are still part of the society we’ve created here in the U.S. according to our own congress.

The Congress finds that

(1) physical or mental disabilities in no way diminish a person’s right to fully participate in all aspects of society, yet many people with physical or mental disabilities have been precluded from doing so because of discrimination; others who have a record of a disability or are regarded as having a disability also have been subjected to discrimination;

(2) historically, society has tended to isolate and segregate individuals with disabilities, and, despite some improvements, such forms of discrimination against individuals with disabilities continue to be a serious and pervasive social problem;

(3) discrimination against individuals with disabilities persists in such critical areas as employment, housing, public accommodations, education, transportation, communication, recreation, institutionalization, health services, voting, and access to public services;

(4) unlike individuals who have experienced discrimination on the basis of race, color, sex, national origin, religion, or age, individuals who have experienced discrimination on the basis of disability have often had no legal recourse to redress such discrimination;

(5) individuals with disabilities continually encounter various forms of discrimination, including outright intentional exclusion, the discriminatory effects of architectural, transportation, and communication barriers, overprotective rules and policies, failure to make modifications to existing facilities and practices, exclusionary qualification standards and criteria, segregation, and relegation to lesser services, programs, activities, benefits, jobs, or other opportunities;

(6) census data, national polls, and other studies have documented that people with disabilities, as a group, occupy an inferior status in our society, and are severely disadvantaged socially, vocationally, economically, and educationally;

(7) the Nation’s proper goals regarding individuals with disabilities are to assure equality of opportunity, full participation, independent living, and economic self-sufficiency for such individuals; and

(8) the continuing existence of unfair and unnecessary discrimination and prejudice denies people with disabilities the opportunity to compete on an equal basis and to pursue those opportunities for which our free society is justifiably famous, and costs the United States billions of dollars in unnecessary expenses resulting from dependency and nonproductivity.

Leave a Comment

Fireworks Audio From Tuttle Lake

Warning: The audio linked to inn this post has multiple explosive sounds and they do start at the beginning and some can be loud.

Over the holiday weekend I was at a family cottage at Tuttle Lake in Wisconsin. Each 4th of July weekend, there is a sizable private fireworks display put on by some cottage owners at the lake.

This year, I captured some of the audio and find the sounds of fireworks over open water an interesting sound. I don’t have the audio equipment to capture the full sound field or the way the concussions echo around the lake but this is still an interesting sound. The fireworks are shot from across the lake, pretty much in a direct line of sight to the end of the dock I was sitting on to capture this.

In this audio you can hear the lingering echos of the explosions but live, you can hear them really sort of move around the lake.

With all that’s going on in the world, I do have to wonder too how these same explosions and yells could be the sounds of war and terror. I feel very fortunate to live where I do and not have had to experience the horror of war directly.

I know there are people on all sides of the political spectrum but will say I hope we can all find a way to come together and protect the people of our country, the institutions of the U.S. Government and work to make life better for the people and planet. It is far from perfect and I know firsthand many of the ways we need to improve but this is the world we have and I hope we can all remember we need to share it with each other and our future.

Leave a Comment

Transcribing Audio with Word Online and a Screen Reader

One of the handy but not so obvious features of a Microsoft 365 subscription in my opinion, is the ability to transcribe 300 minutes of audio a month using the online version of Word. I’ve been using this on and off for many months but have started making regular use of it for some personal projects of late. I also noticed a note when uploading audio recently, that the 300 minute restriction appears to be removed. A note in the transcription area said you can now upload unlimited audio. It did also say this may change in the future but this change makes the audio transcription functionality even more compelling to me.

Office has a help article on using this experience. Here are some notes on using this experience with a screen reader in particular that go a bit further than the help article.

Creating a Document

First off, you are going to want to create a new Word document using the online version of Word. I do this by navigating to http://word.office.com and choosing the option to create a new document. Since Word online works best with any screen reader virtual mode off, I typically turn off the JAWS VPC, Narrator Scan Mode or NVDA Browse mode before navigating to the site.

Assuming you are signed into your Microsoft 365 account, focus should land on the Create a New Blank Document link. Simply activate that link and a new document is created and focus is moved to the editing area.

Transcribing Audio

With focus in the document area, you are going to want to access the Dictation option on the Ribbon and specifically the Transcription option. There are different ways to navigate but what I do is:

  1. Press CTRL+Shift+F6 once to access the ribbon. You should be on the Home option. If on a Mac replace CTRL with CMD.
  2. Press Tab once to access the options on the Home ribbon.
  3. To move to different groups of controls here you can use CTRL+Left and Right arrows. Again, replace CTRL with CMD if on a Mac. I have found it faster to use CTRL+Left arrow from here until I reach the group that announces as Voice with focus on a Dictate button.
  4. Press alt+Down Arrow to access the choices here. Replace Alt with Option if on a Mac.
  5. Arrow down to Transcribe and press Enter.
  6. A new pane will open and focus should go to it. If for some reason it does not, again use CTRL+F6 and CTRL+Shift+F6 to locate the Transcribe Pane.
  7. Tab to the Upload audio button and activate it.
  8. You will get the standard File Open dialog for the operating system you are using. Locate the audio file you want to transcribe and upload the audio.
  9. You will get messages about the audio uploading and being transcribed. I have found it fastest to toggle on the virtual mode for a screen reader and read this text. You should find the text a couple lines above a cancel button when reading in this fashion. Remember to turn back off any virtual mode when done reading. Transcription is quite fast in my experience. It is usually done in less than three minutes for both uploading and transcribing an hour’s worth of audio for me.
  10. When the transcription is finished, you will have some new elements in this pane. They will include:
    1. A list of all the recognized speech you can arrow up and down to review/edit. Tab from an entry in this list for an edit button if you want to make an adjustment to the transcription.
    2. Note that if you choose Edit, a new dialog opens with the section of text in an edit box and the speaker identification in a second edit area. You can adjust the speaker name and there is an option to do it for the entire document as well.
    3. After you arrow to any transcript section, there is also a tab stop to that specific segment. Press enter on this to start audio playing at that point.
    4. Controls for adding sections or all the text to the document.
    5. Audio controls for playing the audio you transcribed.
  11. Use the Add to Document button. You will have choices for Text, text, text with speaker identification, text with timestamps or text with both speakers and timestamps. Use up and down arrow to move through choices and Press enter on the item you want.
  12. Focus will likely stay on the Add to Document button. Use CTRL+Shift+F6 to move back to the document area. You will be at the end of any inserted text.

At this point you can make any adjustments to the text you want. For minor edits, I will use either the edit transcription functionality in the Transcribe Pane or Word Online. For extensive edits, I tend to open the document in the full desktop version of Word. For example, a global search and replace can change all references of Speaker 1 to a specific name.

By default the document, is saved in the root of your OneDrive folder and can be edited online or with the desktop version of Word. Speech to text transcription still requires some editing but this is another helpful tool to use from my experience.

3 Comments