Automated medical scribe

Automated medical scribes (also called artificial intelligence scribes, AI scribes, digital scribes, virtual scribes, ambient AI scribes, AI documentation assistants, and digital/virtual/smart clinical assistants^[1]) are tools for transcribing medical speech, such as patient consultations and dictated medical notes. Many also produce summaries of consultations. Automated medical scribes based on Large Language Models (LLMs, commonly called "AI", short for "artificial intelligence") increased drastically in popularity in 2024. There are privacy and antitrust concerns. Accuracy concerns also exist, and intensify in situations in which tools try to go beyond transcribing and summarizing, and are asked to format information by its meaning, since LLMs do not deal well with meaning (see weak artificial intelligence). Medics using these scribes are generally expected to understand the ethical and legal considerations, and supervise the outputs.

The privacy protections of automated medical scribes vary widely. While it is possible to do all the transcription and summarizing locally, with no connection to the internet, most closed-source providers require that data be sent to their own servers over the internet, processed there, and the results sent back (as with digital voice assistants). Some say^{[according to whom?]} they use zero-knowledge encryption (meaning that the service provider can't access the data). Others explicitly say^{[according to whom?]} that they use patient data to train their AIs, or rent or resell it to third parties; the nature of privacy protections used in such situations is unclear, and they are likely not to be fully effective.^{[citation needed]}

Most providers have not published any safety or utility data in academic journals,^[1] and are not responsive to requests from medical researchers studying their products.^[2]

Privacy

Some providers unclear about what happens to user data.^[3] Some may sell data to third parties.^[1] Some explicitly send user data to for-profit tech companies for secondary purposes,^[1] which may not be specified. Some require users to sign consents to such reuse of their data.^[4] Some ingest user data to train the software,^[1] promising to anonymize it; however, deanonymization may be possible (that is, it may become obvious who the patient is).^[3] It is intrinsically impossible to prevent an LLM from correlating its inputs; they work by finding similar patterns across very large data sets. Some information on the patient will be known from other sources (for instance, information that they were injured in an incident on a certain day might be available from the news media; information that they attended specific appointment locations at specific times is probably available to their cellphone provider/apps/data brokers; information about when they had a baby is probably implied by their online shopping records; and they might mention lifestyle changes to their doctor and on a forum or blog). The software may correlate such information with the "anonymized" clinical consultation record, and, asked about the named patient, provide information which they only told their doctor privately. Because a patient's record is all about the same patient, it is all unavoidably linked; in very many cases, medical histories are intrinsically identifiable.^[5] Depending on how common a condition and what other data is available, K-anonymity may be useless. Differential privacy could theoretically preserve privacy.

Data broker companies like Google, Amazon, and Microsoft have produced or bought up medical scribes,^[2] some of which use user data for secondary purposes,^[4] which has lead to antitrust concerns.^[6] Transfer of patient records for AI training has, in the past, prompted legal action.^[7]

Open-source programs typically do all the transcription locally, on the doctor's own computer.^[8]^[9] Open-source software is widely used in healthcare, with some national public healthcare bodies holding hack days.^[10]

Multifactor authentication for access to the data is expected practice.^[1]

Platforms

Scribes may operate on desktops, laptop, or mobile computers, under a variety of operating systems. These vary in their risks; for instance, mobiles can be lost.^[11]^[12]^[13]^[14] The underlying mobile or desktop operating systems are also part of the trusted computing base, and if they are not secure, the software relying on them cannot be secure either.^[15]^[16]

Confabulation, omissions, and other errors

Like other LLMs, medical-scribe LLMs are prone to confabulation, where they make up content based on statistically associations between their training data and the transcription audio.^[3] LLMs do not distinguish between trying to transcribe the audio and guessing what words will come next, but perform both processes mixed together.^[17] They are especially likely to take short silences or non-speech noises and invent some sort of speech to transcribe them as.^[17]^[4]

LLM medical scribes have been known to confabulate racist and otherwise prejudiced content; this is partly because the training datasets of many LLMs contain pseudoscientific texts about medical racism. They may misgender patients.^[3] A survey found that most doctors preferred, in principle, that scribes be trained on data reviewed by medical subject experts.^[18] Relevant, accurate training data increases the probability of an accurate transcription, but does not guarantee accuracy.^[17] Software trained on thousands of real clinical conversations generated transcripts with lower word error rates. Software trained on manually-transcribed training data did better than software trained with automatically transcribed training data^[2] (such as YouTube captions).

Autoscribes omit parts of the conversation classes as irrelevant. The may wrongly classify pertinent information as irrelevant and omit it. They may also confuse historic and current symptoms, or otherwise misclassify information. They may also simply wrongly transcribe the speech, writing something incorrect instead. If clinicians do not carefully check the recording, such mistakes could make their way into their medical records and cause patient harms.^[1]

Patient consent

Professional organizations generally require that scribes be used only with patient consent; some bodies may require written consent. Medics must also abide by local surveillance laws, which may criminalize recording private conversations without consent.^[1] Full information on how data is encrypted, transmitted, stored, and destroyed should be provided. In some jurisdictions, it is illegal to transmit the data to any country without equivalent privacy laws, or process or store the data there; vendors who cannot guarantee that their products won't illegally send data abroad cannot be legally used.^[1]

Some vendors collect data for reuse or resale. Medical professionals are generally considered to have a duty to review the terms and conditions of the user agreement and identify such data reuse.^[1] General practices are generally required to provide information on secondary uses to patients, allow them to opt out of secondary uses, and obtain consent for each specific secondary use. Data must only be used for agreed-upon purposes.^[1]^[19]

Technology and market

The medical scribe market is, as of 2024^[update], highly competitive, with over 50 products on the market. Many of these products are just proprietary wrappers around the same LLM backends,^[6] including backends whose designers have warned they are not to be used for critical applications like medicine.^[20] Some vendors market scribes specialized to specific branches of medicine (though most target general practitioners, who make up about a third of doctors^[where?]). Increasingly, vendors market their products as more than scribes, claiming that they are intelligent assistants and co-pilots to doctors.^[6] These broader uses raise more accuracy concerns.^[17]^[4] Extracting information from the conversation to autopopulate a form, for instance, may be problematic, with symptoms incorrectly auto-labelled as "absent" even if they were repeatedly discussed. Models failed to extract many indirect descriptions of symptoms, like a patient saying they could only sleep for four hours (instead of using the word "insomnia").^[2]

LLMs are not trained to produce facts, but things which look like facts. The use of templates and rules can make them more reliable at extracting semantic information,^[20] but "confabulations" or "hallucinations" (convincing but wrong output) are an intrinsic part of the technology.

Pricing

With the exception of fully open-source programs, which are free, medical scribe computer programs are rented rather than sold ("software as a service"). Monthly fees vary from mid-two figures to four figures, in US dollars. Some companies run on a freemium model, where a certain number of transcriptions per month are free.^[21]^[22]

Scribes that integrate into Electronic Health Records, removing the need for copy-pasting, typically cost more.^[23]^{[better source needed]}

Fully open-source scribes provide the software for free. The user can install it on hardware of their choice, or pay to have it installed. Some open-source scribes can be installed on the local device (that is, the one recording the audio) or on a local server (for instance, one serving a single clinic). They can typically be set not to send any information externally, and can indeed be used with no internet connection.

References

^ ^a ^b ^c ^d ^e ^f ^g ^h ⁱ ^j ^k "RACGP - Artificial intelligence (AI) scribes". www.racgp.org.au. Retrieved 13 December 2024.
^ ^a ^b ^c ^d van Buchem, Marieke M.; Boosman, Hileen; Bauer, Martijn P.; Kant, Ilse M. J.; Cammel, Simone A.; Steyerberg, Ewout W. (26 March 2021). "The digital scribe in clinical practice: a scoping review and research agenda". npj Digital Medicine. 4 (1): 1–8. doi:10.1038/s41746-021-00432-5. ISSN 2398-6352. Retrieved 13 December 2024.
^ ^a ^b ^c ^d Kuzub, Alena (1212). "How will AI scribes affect the quality of health care?". Northeastern Global News. Retrieved 13 December 2024.
^ ^a ^b ^c ^d "Researchers say an AI-powered transcription tool used in hospitals invents things no one ever said". AP News. 26 October 2024. Retrieved 13 December 2024.
^ "NHS plans leave 'anonymous' medical data vulnerable". New Scientist. Retrieved 13 December 2024.
^ ^a ^b ^c Dorn, Spencer. "Where AI Ambient Scribes Are Heading". Forbes. Retrieved 13 December 2024.
^ Stanton, Rich (22 October 2021). "Google AI department sued for using the health data of 1.6 million NHS patients | PC Gamer". PC Gamer. Archived from the original on 22 October 2021. Retrieved 13 December 2024.
^ "ClinicianFOCUS/FreeScribe". ClinicianFOCUS. 12 December 2024. Retrieved 13 December 2024.
^ Schmidt, Rebecca (12 February 2024). "Automatische Transkriptionssoftware – ein Erfahrungsbericht". Sozialwissenschaftliche Methodenberatung (in German). doi:10.58079/vsz4. Retrieved 13 December 2024.
^ Karopka, T.; Schmuhl, H.; Demski, H. (2014). "Free/Libre Open Source Software in Health Care: A Review". Healthcare Informatics Research. 20 (1): 11–22. doi:10.4258/hir.2014.20.1.11. PMC 3950260. PMID 24627814.
^ Perera, Chandrashan (1 June 2012). "Principles of Security for the use of Mobile Technology in Medicine". Journal of Mobile Technology in Medicine: 5–7. doi:10.7309/jmtm.10.
^ "How Can You Protect and Secure Health Information When Using a Mobile Device? | HealthIT.gov". www.healthit.gov. Retrieved 13 December 2024.
^ Martínez-Pérez, Borja; de la Torre-Díez, Isabel; López-Coronado, Miguel (January 2015). "Privacy and Security in Mobile Health Apps: A Review and Recommendations". Journal of Medical Systems. 39 (1). doi:10.1007/s10916-014-0181-3.
^ Xing, Yawen; Lu, Huizhe; Zhao, Lifei; Cao, Shihua (9 September 2024). "Privacy and Security Issues in Mobile Medical Information Systems MMIS". Mobile Networks and Applications. doi:10.1007/s11036-024-02299-8.
^ The Nizza Secure-System Architecture (PDF) (Report). 2005. Retrieved 13 December 2024.
^ WEiss, Luca. "Porting mainline Linux to mobile phones". archive.fosdem.org. Retrieved 13 December 2024.
^ ^a ^b ^c ^d Edwards, Benj (28 October 2024). "Hospitals adopt error-prone AI transcription tools despite warnings". Ars Technica. Retrieved 13 December 2024.
^ Landi, Heather (11 October 2024). "Abridge integrates UpToDate decision support into AI scribe". www.fiercehealthcare.com. Retrieved 13 December 2024.
^ "RACGP - Three key principles for the secondary use of general practice data by third parties". www.racgp.org.au. Retrieved 13 December 2024.
^ ^a ^b Edwards, Benj (18 April 2023). "GPT-4 will hunt for trends in medical records thanks to Microsoft and Epic". Ars Technica. Retrieved 13 December 2024.
^ Brodwin, Erin (21 March 2024). "Comparing AI medical scribes by price and features". Axios. Retrieved 13 December 2024.
^ "AI scribe wars heating up - Medical Republic". 2 November 2024. Archived from the original on 2 November 2024. Retrieved 13 December 2024.
^ "AI medical scribes: Boosting efficiency or risking over-reliance?". KevinMD.com. 29 October 2024. Retrieved 13 December 2024.

[racgp-1] ^ ^a ^b ^c ^d ^e ^f ^g ^h ⁱ ^j ^k "RACGP - Artificial intelligence (AI) scribes". www.racgp.org.au. Retrieved 13 December 2024.

[nat_rev-2] van Buchem, Marieke M.; Boosman, Hileen; Bauer, Martijn P.; Kant, Ilse M. J.; Cammel, Simone A.; Steyerberg, Ewout W. (26 March 2021). "The digital scribe in clinical practice: a scoping review and research agenda". npj Digital Medicine. 4 (1): 1–8. doi:10.1038/s41746-021-00432-5. ISSN 2398-6352. Retrieved 13 December 2024.

[ne_U-3] Kuzub, Alena (1212). "How will AI scribes affect the quality of health care?". Northeastern Global News. Retrieved 13 December 2024.

[AP_warn-4] "Researchers say an AI-powered transcription tool used in hospitals invents things no one ever said". AP News. 26 October 2024. Retrieved 13 December 2024.

[5] "NHS plans leave 'anonymous' medical data vulnerable". New Scientist. Retrieved 13 December 2024.

[forbes_heading-6] Dorn, Spencer. "Where AI Ambient Scribes Are Heading". Forbes. Retrieved 13 December 2024.

[7] Stanton, Rich (22 October 2021). "Google AI department sued for using the health data of 1.6 million NHS patients | PC Gamer". PC Gamer. Archived from the original on 22 October 2021. Retrieved 13 December 2024.

[8] "ClinicianFOCUS/FreeScribe". ClinicianFOCUS. 12 December 2024. Retrieved 13 December 2024.

[Schmidt-9] Schmidt, Rebecca (12 February 2024). "Automatische Transkriptionssoftware – ein Erfahrungsbericht". Sozialwissenschaftliche Methodenberatung (in German). doi:10.58079/vsz4. Retrieved 13 December 2024.

[10] Karopka, T.; Schmuhl, H.; Demski, H. (2014). "Free/Libre Open Source Software in Health Care: A Review". Healthcare Informatics Research. 20 (1): 11–22. doi:10.4258/hir.2014.20.1.11. PMC 3950260. PMID 24627814.

[mobile_sec-11] Perera, Chandrashan (1 June 2012). "Principles of Security for the use of Mobile Technology in Medicine". Journal of Mobile Technology in Medicine: 5–7. doi:10.7309/jmtm.10.

[12] "How Can You Protect and Secure Health Information When Using a Mobile Device? | HealthIT.gov". www.healthit.gov. Retrieved 13 December 2024.

[13] Martínez-Pérez, Borja; de la Torre-Díez, Isabel; López-Coronado, Miguel (January 2015). "Privacy and Security in Mobile Health Apps: A Review and Recommendations". Journal of Medical Systems. 39 (1). doi:10.1007/s10916-014-0181-3.

[14] Xing, Yawen; Lu, Huizhe; Zhao, Lifei; Cao, Shihua (9 September 2024). "Privacy and Security Issues in Mobile Medical Information Systems MMIS". Mobile Networks and Applications. doi:10.1007/s11036-024-02299-8.

[15] The Nizza Secure-System Architecture (PDF) (Report). 2005. Retrieved 13 December 2024.

[16] WEiss, Luca. "Porting mainline Linux to mobile phones". archive.fosdem.org. Retrieved 13 December 2024.

[ArsT_warn-17] Edwards, Benj (28 October 2024). "Hospitals adopt error-prone AI transcription tools despite warnings". Ars Technica. Retrieved 13 December 2024.

[integration-18] Landi, Heather (11 October 2024). "Abridge integrates UpToDate decision support into AI scribe". www.fiercehealthcare.com. Retrieved 13 December 2024.

[19] "RACGP - Three key principles for the secondary use of general practice data by third parties". www.racgp.org.au. Retrieved 13 December 2024.

[Arst_GPT_warn-20] Edwards, Benj (18 April 2023). "GPT-4 will hunt for trends in medical records thanks to Microsoft and Epic". Ars Technica. Retrieved 13 December 2024.

[21] Brodwin, Erin (21 March 2024). "Comparing AI medical scribes by price and features". Axios. Retrieved 13 December 2024.

[22] "AI scribe wars heating up - Medical Republic". 2 November 2024. Archived from the original on 2 November 2024. Retrieved 13 December 2024.

[23] "AI medical scribes: Boosting efficiency or risking over-reliance?". KevinMD.com. 29 October 2024. Retrieved 13 December 2024.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]