- 1
ChatGPT is launching voice and image capabilities in the latest upgrades. - 2
Plus, Enterprise users will be able to access voice and image features in ChatGPT over the next 2 weeks.
ChatGPT is beginning with the new addition of voice and snap editions which will benefit the user in unique ways. It is a more intuitive way to get the result by offering the conversation or showing ChatGPT what the user wants to know about.
Voice is coming out in iOS and Android while image is available on all platforms.
The user can use the voice to engage in alternative conversations with assistance. The user can request a story or solution to any problem. For this, go to the settings option, click on the new feature on the mobile app and opt for voice conversation. Click on the headphone button on the right corner of the home screen. New voice capability is powered by the next text-to-speech model which helps to generate human-like voices.
Details of ChatGPT New Features
They have collaborated with voice artists. The preferred voice choices are 5 from which one has to be selected. The audio is created from the text. For transcribing spoken words to texts ‘Whisper’ is used.
For images, tap the image button to capture or choose the image. If using the smartphone, whether iOS or Android, choose the + button. Discuss the multiple images or use the drawing tool. Image understanding is powered by GPT3.5 and GPT-4. A wide range of images like photographs, screenshots, and documents containing text and images are read.
Vision-based models may raise new challenges so to avoid them before deployment they tested the model with read teamers for risks like extremism and scientific proficiency. This research has enabled them to put on a few details for responsible usage.
Like other ChatGPT features, the vision is to assist users with daily life. Be For My Eyes, the mobile app for blinds and low vision people helps to understand the uses and limitations. Users told them that they find it valuable to have conversations about images. They input this feature in ChatGPT and significantly limit the ChatGPT feature to analyze and make direct statements. By doing this, ChatGPT respects the individual’s privacy.
ChatGPT provides transparency about the limitations. Users might depend on specialized topics like research. The model is proficient with the English language but with other languages, the results are not well-versed.
The voice facility is the new voice chat created with voice actors. This new voice translation technology is capable of producing realistic synthetic voices from just a few seconds of real speech.
Summary
Plus & Enterprise users will get the voice and images facility in the next two weeks which helps converse and get information related to the images. The main focus of ChatGPT is for users to provide more information in any easy manner of voice and details through snaps.
Source: https://www.thecoinrepublic.com/2023/10/01/chatgpt-enables-voice-image-features-for-plus-enterprise-users/