AI Medical Transcription App
Overview
This Medical Transcription App is a web-based tool that uses speech recognition and AI to transcribe and process medical information. It’s designed to assist medical professionals in quickly documenting patient information, including findings, symptoms, potential diagnoses, plans, and treatments.

Features
- Real-time speech-to-text transcription
- AI-powered processing of medical information
- Categorization of medical data into structured formats
- Support for multiple AI providers (OpenAI, Anthropic, Groq)
- User-friendly interface with visual feedback
Prerequisites
- A modern web browser (Chrome or Edge recommended for best speech recognition support)
- An API key from one of the supported AI providers (OpenAI, Anthropic, or Groq)
Installation
- Clone this repository:
git clone https://github.com/jmesplana/ai_medical_transciber.git
- Navigate to the project directory:
- Open
index.html in your web browser.
Usage
- Open the app in your web browser.
- Enter your API key in the provided field.
- Click the microphone icon to start recording.
- Speak clearly, providing medical information about a patient.
- The app will transcribe your speech in real-time.
- Once you stop speaking, the AI will process the transcription and categorize the information.
- Review the processed information in the “Processed Data” section.
Important Notes
- Security: This app processes the API key client-side. This is not secure for production use and is intended for testing and demonstration purposes only.
- Privacy: Be aware that transcriptions are sent to external AI services for processing. Ensure you have appropriate consent and follow relevant privacy regulations when using real patient data.
- Costs: Using the AI services may incur costs depending on your API plan. Monitor your usage to manage expenses.
Customization
You can customize the AI provider by modifying the apiUrl in the processTranscription function. The default is set to OpenAI’s API.
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
License
This project is licensed under the MIT License - see the LICENSE.md file for details.
Acknowledgments
- Web Speech API for speech recognition capabilities
- OpenAI, Anthropic, and Groq for AI processing capabilities
If you have any questions or feedback, please open an issue in this repository.