For people with disabilities, as well as just for sybarites, OS developers have created voice control computer. It allows the user to enter information by voice. After pronouncing certain words, the device starts speech recognition - converting the audio signal into digital information. After the entered information is correctly recognized - the program proceeds to the specified action algorithm - it performs the function that is attached to a particular command.

Everything is quite simple. Speech is not always recognized correctly, so the computer voice control program is not intensively used to solve complex tasks of managing the operating system. It is used to perform basic functions: opening and closing files, local and network searches, etc.

The history of the development of voice control

  • The first Audrey voice recognition system was created in the 1950s. She deciphered only the numbers spoken in one voice.
  • In 1962, the first word recognition system was created. She transcribed 15 English words.
  • With the development of computers in 1990, the Dragon Dictate program was developed. She recognized up to 100 words per minute, but had a high price.
  • In the early 2000s, the speech recognition app Google Voice Search appeared on the iPhone. In 2010, a search engine was added to Android.
  • Siri has been included in software Phone 4S in early October 2011;
  • In 2014, Cortana, a voice assistant for Windows, was introduced.

Cortana and voice input capabilities to date

Cortana is a virtual assistant in the Windows operating system. The service helps the user in planning things, reminds them of them.
At a certain request, the service will help to collect specific information, create a clear structure and present it to the user in the most processed form possible.
It is interesting that immediately upon turning on the virtual assistant collects all the information about the entered requests, personal data, trying to adapt as much as possible to each individual user.


Voice control of a Windows 7 computer through the use of a virtual assistant is not possible - it is integrated only into the tenth version. But, sadly, the developers did not bother to release the Russian-language version.
The main role is played by the search, which in 10-ke can be opened through the "Start". This function defines almost any queries. If the entry is not recognized, you can enter the appropriate command in the pop-up window and the computer voice control program reads the text information.

An unpleasant moment is the collection of all data entered through the keyboard and sending them to Microsoft.

Third Party Programs

Type

After completing the installation, proceed to the next step - create account. Here you need to come up with a key phrase, after which an activation notification will sound.


Next, you will need to come up with and create voice commands, regardless of their purpose. The dog command can launch an application or do something else entirely.




You just need to create a voice command and assign it to a specific action. Suitable only for performing basic operations - opening files, folders, etc. The functionality is limited.

speaker

Here the functionality is wider than in Type.


Voice control of a Windows 10 computer provides the user with the ability to open and close files, take screenshots of the screen, turn off the PC.


Speech recognition takes a long time, over 3-4 seconds. This is due to the fact that speech is first converted into text, and commands are recognized by the computer already from textual information.

Laitis

it free program, which allows you to both control your PC and dictate text. After installation, you need to register and then you can use it for your pleasure.


An interesting autocorrect function when typing. You can say "quotation marks" and the corresponding character will appear in the text.

Possibilities of voice control through Yandex.string

Through the use of this application, you can perform local or network searches for information and files, restart or shut down your computer. There is a function to open programs and sites.
To use the program, you must first download and install it.

But during installation, it is worth unchecking the boxes opposite the items where the software manufacturer offers to install a browser, change its settings. Otherwise, the installation will take longer and the configuration will change in the browser.
Ultimately, the line is placed near the "Start" button. Say "Listen to Yandex" and a window will open.

Speak the request.

After a pause, a search bar will open in the browser. It's a good idea to manage your search this way.
In general, computer voice control has not yet been developed, as it is drawn to us in the imagination. But even those features that are available today are already impressive and significantly help to move to a new level of PC use.

Have a great day!

You can interact with a computer not only using the keyboard and mouse. Voice command control is also available. There are special utilities that allow you to do this. Their functions include not only recording text from dictation or transcribing audio recordings. Through them, you can run applications, use them, and in general - do anything. Controlling your computer with your voice makes it easier to work with a PC. Commands are transmitted promptly and effortlessly. Of course, if you have a microphone.

We will introduce you to applications with similar functionality.

This feature is built into the English Windows version. You must have an Enterprise or Ultimate license to use it. But also in Russified operating systems You can set up voice control and text dictation. Use one of the following apps.

The app is easy to understand

Popular program. Although it has its drawbacks. The essence of its work is simple: you set a command and choose what action it should perform. Consider setting up this application using a specific example.

  1. Download and install it. There is a free and premium version. The second one must be bought to try it on a computer.
  2. Run the utility. An information window with hints will appear in it.
  3. Its control panel has many different functions. Some of them have the same name. It is necessary to navigate by the picture, not by the inscription. Press the "Add" button - it shows a face.
  4. Specify a profile name and a keyword to identify the command. For example, write "open" if you are going to set up the launch of some application by voice. Or "go to" to instantly go to a site without entering its address.
  5. Now we need to record this very command in the form of a sound image. Click on the button with the red circle. And clearly, clearly pronounce the desired phrase into the microphone.
  6. Confirm changes. The specified option will appear in the list in the Type window. The program will remember what you have recorded on its "voice recorder".
  7. After that, specify what, in fact, to run it to execute the command. Click the "Add" button, which looks like a hand with a "+" (plus) symbol.
  8. Select the data format: files/utilities, web pages, some internal OS services. Put the checkboxes where you want.
  9. Find the application you want to launch with your voice. Let it be, for example, Microsoft Word. So you can very quickly start editing some text or writing an article.
  10. In the same window, write down the second part of the command. So that in total it turns out to "open the Word". The first word will enable Type, the second will enable the associated utility.
  11. Click Add.
  12. You can attach several applications to one “open” function. This way you will control their launch without touching the mouse and other peripherals installed on the computer.
  13. If necessary, edit additional parameters.
  14. To check if it worked or not, click on "Start talking" and say the command.

The program works with the Russian language. But it doesn't always recognize it correctly. It is necessary to speak loudly, clearly, in a mechanical voice.

  • Knowledge of English is not required.
  • Quick command creation.
  • No text recognition.
  • Limited functionality. You can only open utilities and pages on the Internet.
  • The program sometimes perceives extraneous noise as commands. Because of this, strange things happen on the PC.
  • You can not work with the player.

Speechka

Another application for computer management

  1. At the first start, a window will open with a choice of category: PC or Internet.
  2. There is also an explanation of what keyboard shortcut to activate the utility. This can be changed in the settings.
  3. Click on "Internet" for example. A window will open with several input fields: for the command text and for the site URL. You can write the word "Yandex" and the address of this page.
  4. Click Add.
  5. Hold down the keys indicated on the main window.
  6. Say the command so that the utility “remembers” it.
  • Activation by both keys and sound volume.
  • At startup, you can calibrate the microphone.
  • Limited functionality.

speaker

The interface is designed in a minimalist way

Commands in the application are configured using printed words, not dictation. There is an internal text recognition mechanism. Main functions:

  • Create screenshots on command.
  • Change the keyboard layout on your computer.
  • Opening applications and files.
  • Completion of work.
  • You can not make an audio recording with the team. The utility recognizes labels.
  • You need to use the keyboard to control it. If the specified button is used for other purposes, it will be inconvenient.
  • You need a stable internet connection.

Gorynych

The hero of Russian folk tales will help you

The program for controlling a computer with the voice of "Gorynych" is a domestic product. Therefore, there is a "native" speech recognition module. It "adapts" to the timbre and intonations of the user. With the utility, you can fully work in the system, and not just open files and web pages.

  • There is support for Russian and English languages.
  • Text recognising, voice input to any editor.
  • Extended functionality.
  • It is necessary to independently create commands for each process. Literally, you have to write down a dictionary.

Windows Speech Recognition

A program built into the English OS. To use it, you must have the appropriate language pack installed. Russian teams will not work with her. To control a PC with it, you will have to speak its language. To access it in the Panel Windows settings open the " Regional and Language Standards" menu (it is located in the "Hours, language, region" category) and set "English" in all tabs. If everything is correct, and you have the necessary language pack installed, Windows will “turn” into English, and the utility will become available. It is better not to try this method if you do not know a foreign language well.

This method is suitable if you speak English

Other utilities

There are a number of applications for managing such commands:

  • Browser extensions. Facilitate web surfing. AT Google Chrome a similar function is already built in - voice input in search forms. This option is available on some online maps. It allows you to quickly find the address.
  • voicetype.
  • RealSpeaker.
  • Web Speech.

List of text recognition and dictation software

Voice control is, of course, good. But utilities for OCR and typing from dictation can be useful. When compiling voluminous reports, diplomas, it is easier to write down your thoughts by voicing them into a microphone on a computer. Here are some of these utilities:

  • Dictograph.
  • Dragon Naturally Speaking.
  • Perpetuum Mobile.

A product that allows you to dictate text to a computer

You can set up voice commands in the OS. To do this, the appropriate program must be installed on the computer. With it, you can work on a PC, lying on the couch or lounging in an armchair. You will have free hands. If the microphone is good and picks up even distant or quiet sounds, you won't need to sit next to it. You can simultaneously "talk" with the computer and write notes in a notebook, draw, hold something. Yes, even sew and knit. With commands, interacting with the PC is much easier. To activate some of these utilities, you need to press buttons on the keyboard, which is not very convenient.

But there are also negative aspects. If you accidentally say a command word, an application that is completely unnecessary right now will open or the browser will go to some site in the wrong place. What to use and whether to use at all - it's up to you.

Today we will talk about our speech. Would you like control computer by voice, without the help of fingers? And, as they say, by the power of thought! True, we will not control the computer with the power of thought, but with the power of the voice it is quite real.

Type program- This is one of the best software for controlling a computer through voice. On sites in the comments to this program, opinions converge.

True, it has its shortcomings. But more on that later. By the way, if you are interested, read my review.

You can download the program here: http://freesoft.ru/type

How to use it? First, let's run it and see the main control buttons:

The program welcomes us and immediately gives us hints on how to use Type. At the beginning, we will press the “add” button and write down the word, for example, “open”. To do this, say this word into the microphone:

Then click add. So, we saved the word “Open” in the program with our voice. You can speak any other words into the microphone. The main thing is not to get confused.

The next step is to add commands. To do this, go to this point:

Then we check the box next to the item that we need:

Select a program, application or action and click on the red record button. If the computer accepted our voice, click "Add":

And now one voice command will be visible in our profile. In this case, the one that opens 7-Zip:

And now, by pressing the final button "start talking"

we say the phrase "open Seven Zip". In my case, everything will work. And the 7-zip program will open. Remember this phrase: Sim sim open? Here is something about the same.

The program does not always work properly. Now the mighty Russian language has not been fully studied by linguist programmers ... But still, it's nice when a computer obeys you.

Therefore, for testing and banal curiosity, the Typle program is 100% suitable.

In this video you can see the history of the creation of the first voice engines and what else we need to work on:

There are such terrible names of other analogues of the program as Gorynych, Perpetuum, Dictograph, Voice Commander. But all of them are “not that one”. Do not pass the criticism of a worthy program.

It took me 5 minutes to master this program. This is quite a long time (mostly, I understand such programs in 1-2 minutes). If you have any questions - write. See you soon, friends :)!

In another attempt to implement ideas from science fiction films, one by one, tech giants began to work on virtual assistants. At the Google I/O 2016 conference, the company introduced Google Assistant, as well as an analogue of Amazon Echo - Google Home voice assistant.

The last major company to enter the virtual assistant race. Let's see what competitors it has (including among startups).

10 Virtual Assistants: An Overview

Alexey Zenkov

First, let's remember what Google Now is.

Google/Google Now voice search

Peculiarities: Quick. Extremely accurate when creating routes. It frightens with its awareness of your flights, bookings and other details. with some third party applications: Manage notes, messages and music playback.

Flaws: Sometimes it bothers you with excessive initiative (for example, it shows the results of the games of teams that you are not interested in, or routes home from famous places). Useless when managing a "smart home". Work on integration with third-party applications seems to have stalled.

Humanity level: Null. Not conducive to communication. It doesn't even have a name other than Google.

Summary: Vast reserves of personal data and access to a search engine should, in theory, make Google an industry leader, but the company hasn't even been able to figure out how to use its advantages and create an assistant that can understand the user. Today Google Now and voice search they compete on equal terms with Siri, but have not yet reached a new level of development.

And now - about competitors.

Apple Siri

What: A voice assistant that can talk to the user and give proactive recommendations. Activated by long pressing the Home button on and iPad. Assistant support has recently appeared on Apple TV and Apple Watch.

Peculiarities: Easy to use on iOS devices. Understands natural speech. Well informed about news, weather, sports, movies, routes and local businesses. Can tell you what to watch on TV. Able to interact with some elements of the "smart home".

Flaws: Cannot interact with most other applications and services. Works slower than some competitors.

Humanity level: Not able to maintain a full conversation, but at certain points demonstrates his own wisdom. The female voice sounds relatively human.

Flaws: Feels great on Windows - the platform that developers, and perhaps users, are least interested in. Using the assistant on Android and iOS is more difficult, and there are fewer functions.

Humanity level: He loves jokes, especially banal ones. Has a long list of witty answers to common questions at the ready. Can read excerpts from Shakespeare.

Summary: After years of being in the shadow of Siri and Google, Cortana has become a much more interesting chatbot. Microsoft wants to make their own voice assistant basic intelligence for all other bots that can manage your travels, appointments, to-do lists and other things, as well as increase the degree of integration with other Microsoft products, such as Office. The company is aiming to create a new shell for post-PC computing, but it's too early to tell if it will succeed or not.

Facebook M

What: Partly driven, partly human, and still in development. M will be a text-based assistant in the Facebook Messenger environment.

Peculiarities: Will try to do whatever you ask.

Flaws: It is not yet a finished product, and will not be for a long time. Available only to a small number of users in San Francisco.

Humanity level: Extremely high, since people will participate in the formation of answers to questions. According to Wired, the company hopes that over time, M will learn from these operators and be able to work more independently.

Summary: At the moment, M is just a little more than just an idea. But given Facebook's interest in chatbots in general, it wouldn't be surprising if M ended up becoming super-intelligent.

X.ai

What: One of the few virtual assistants with only one function. Works only through e-mail, where he can make appointments at your request.

Peculiarities: Knows your schedule and preferences, negotiates with other participants for you.

Peculiarities: Viv promises that their product will be able to understand complex questions, such as: “Will the temperature near the Golden Gate Bridge exceed 20 degrees the day after tomorrow after 5 p.m.?”. Work is underway to ensure compatibility with third-party applications.

Flaws: So far, apart from prepared presentations, there is no evidence that everything works exactly as stated.

Humanity level: Values ​​visual aids and concrete answers more than detailed description. Wit is questionable.

Flaws: Possibilities for integration with third-party applications are limited, and it is impossible to open the service directly on iOS or Android. Requests that the assistant cannot recognize are redirected to Ask.com.

Humanity level: Not conducive to long conversations, but knows how to answer additional questions.

Summary: It seems that mobile applications Hound actually exists only to show the capabilities of the Houndify service, which SoundHound plans to sell to other companies. If everything works out, we will not even know that we are using it.

Ozlo

What: AI, the main function of which at the moment is the search for cafes, bars and restaurants. Available for a limited number of users.

Peculiarities: Finds and combines data from several sources, including Yelp and Foursquare, and then presents everything in the form of convenient cards. Tries to communicate by asking and answering follow-up questions, such as "what places are open right now?" or "what's on their menu?"

Flaws: Limited features, unless the creators of Ozlo add new features. When learning, AI is highly dependent on users.

Humanity level: Avoids unnecessary courtesies, only briefly greetings by name.

Summary: Ozlo would be no different from a lot of other chatbots if it didn't have the prospect of building something bigger. The ability to combine data from multiple sources in a single output is unique, but it is not yet clear whether the developers will be able to realize the full potential they claim. As long as Ozlo's business plan is limited to just the app, it can be a challenge to collect the data needed for training.

SpeakToIt Assistant.ai

What: One of the many copies of Siri. In the app store, searching for Siri brings up many similar programs, such as Voice Commands, Voice Secretary, and Assistant.

Peculiarities: Not unlike Siri, but can learn user commands to activate a list of features.

Flaws: Not as useful as the built-in assistant in your smartphone, and not as convenient.

Humanity level: Sounds rather unnatural, but portrays himself as a human assistant whose gender and appearance can be changed.

Summary: Some of these Siri clones look like a relic of the past, when not all iPhone models could work with Apple's proprietary assistant and needed to be replaced. In any case, it seems that their creators are aware that such an approach will not allow them to succeed. For example, SpeakToIt moved on to creating a set of tools that other developers could use to build their own chatbots.