Later this year, Apple plans to provide real-time voice transcription and summarization feature system-wide across its various devices, aiming to utilize artificial intelligence to boost the performance of its key applications.
The company introduces AI-powered summarization technology and enhanced voice transcription for several of its operating systems.
The new features are expected to significantly enhance the efficiency of users in using note-taking and voice memo applications, as well as other applications.
Currently, Apple is testing the capabilities as additions to updates planned for release with iOS 18 in 2024.
It is also expected to reach corresponding applications in macOS 15 and iPadOS 18.
Apple’s virtual voice memo technology provided across its devices is expected to be one of the first applications to benefit from the new enhancements.
The latest versions of the app provide a running transcript for each audio recording, operating in the same manner as the company’s direct voicemail feature.
The written transcripts run in the center area of the app window, replacing the visual representation of the recorded audio present in the current version of the app.
Pre-release versions of both apps include a copy button shaped like a speech bubble. Clicking on the speech bubble displays a copy of the audio recording within the app.
The voice transcription feature offers a new opportunity for audio recording in the Notes app, with the update adding the ability to summarize the recorded audio using artificial intelligence, allowing for a textual summary of key points.
The use of AI-powered summarization, in addition to the new audio recording and real-time transcription options, is expected to enhance the Notes app.
These three features work to benefit a wide range of practical applications, requiring processing of large amounts of data and essential access to key points.
Apple has made significant efforts to ensure the accuracy of the voice transcription and summarization features, ensuring the preservation of the original audio with the text generated by AI without losing any information from the source during the copying or summarization process.