Project push

2024-04-17 09:57:32 +02:00
commit b3a619a898
4 changed files with 184 additions and 0 deletions
--- a/.gitignore
+++ b/.gitignore
@@ -0,0 +1,5 @@
+# Virtual Env
+.venv
+
+# Local data
+.DS_Store
--- a/README.md
+++ b/README.md
@@ -0,0 +1,78 @@
+# Audio Summary with local LLM
+
+This tool is designed to provide a quick and concise summary of audio and video files. It supports summarizing content either from a local file or directly from YouTube. The tool uses Whisper for transcription and a local version of Mistral AI (Ollama) for generating summaries.
+
+> [!TIP]  
+> It is possible to change the model you wish to use.
+> To do this, change the `OLLAMA_MODEL` variable, and download the associated model via [ollama](https://github.com/ollama/ollama)
+
+## Features
+
+- **YouTube Integration**: Download and summarize content directly from YouTube.
+- **Local File Support**: Summarize audio files available on your local disk.
+- **Transcription**: Converts audio content to text using Whisper.
+- **Summarization**: Generates a concise summary using Mistral AI (Ollama).
+
+## Prerequisites
+
+Before you start using this tool, you need to install the following dependencies:
+
+- Python 3.8 or higher
+- `pytube` for downloading videos from YouTube.
+- `pathlib`for local file
+- `openai-whisper` for audio transcription.
+- [Ollama](https://ollama.com) for LLM model management.
+- `ffmpeg` (required for whisper)
+
+## Installation
+
+### Python Requirements
+
+Clone the repository and install the required Python packages:
+
+```bash
+git clone https://github.com/damienarnodo/audio-summary-with-local-LLM.git
+cd audio-summary-with-local-LLM
+pip install -r src/requirements.txt
+```
+
+### LLM Requierement
+
+[Download and install](https://ollama.com) Ollama to carry out LLM Management
+More details about LLM model supported can be discribe on the Ollama [github](https://github.com/ollama/ollama).
+
+Download and use Mistral model :
+
+```bash
+ollama pull mistral
+
+## Test the access :
+ollama run mistral "tell me a joke"
+```
+
+## Usage
+
+The tool can be executed with the following command line options:
+
+- `--from-youtube`: To download and summarize a video from YouTube.
+- `--from-local`: To load and summarize an audio or video file from the local disk.
+
+### Examples
+
+1. **Summarizing a YouTube video:**
+
+   ```bash
+   python src/summary.py --from-youtube <YouTube-Video-URL>
+   ```
+
+2. **Summarizing a local audio file:**
+
+   ```bash
+   python src/summary.py --from-local <path-to-audio-file>
+   ```
+
+The output summary will be saved in a markdown file in the specified output directory.
+
+## Output
+
+The summarized content is saved as a markdown file named `summary.md` in the current working directory. This file includes the transcribed text and its corresponding summary.
--- a/src/requirements.txt
+++ b/src/requirements.txt
@@ -0,0 +1,3 @@
+openai-whisper==20231117
+pytube==15.0.0
+ollama==0.1.8
--- a/src/summary.py
+++ b/src/summary.py
@@ -0,0 +1,98 @@
+import whisper
+import ollama
+import argparse
+from pytube import YouTube
+from pathlib import Path
+
+
+WHISPER_MODEL = "base"
+OLLAMA_MODEL = "mistral"
+
+# Function to download a video from YouTube
+def download_from_youtube(url: str, path: str):
+    yt = YouTube(url)
+    # Filter streams to get the highest resolution progressive mp4 stream
+    stream = yt.streams.filter(file_extension="mp4", only_audio=True).first()
+    # Download the video to the specified path
+    stream.download(Path(path), filename="to_transcribe.mp4")
+
+# Function to transcribe an audio file using the Whisper model
+def transcribe_file(file_path: str, output_file: str) -> str:
+    # Load the Whisper model
+    model = whisper.load_model(WHISPER_MODEL)
+    # Transcribe the audio file
+    transcribe = model.transcribe(file_path)
+    # Save the transcribed text to the specified temporary file
+    with open(output_file, 'w') as tmp_file:
+        tmp_file.write(transcribe["text"])
+        print(f"Transcription saved to file: {output_file}")
+    # Return the transcribed text
+    return transcribe["text"]
+
+# Function to summarize a text using the Ollama model
+def summarize_text(text: str, output_path: str) -> str:
+    # Define the system prompt for the Ollama model
+    system_prompt = f"I would like for you to assume the role of a Technical Expert"
+    # Define the user prompt for the Ollama model
+    user_prompt = f"""Generate a concise summary of the text below.
+    Text : {text}
+    Add a title to the summary.
+    Make sure your summary has useful and true information about the main points of the topic.
+    Begin with a short introduction explaining the topic. If you can, use bullet points to list important details,
+    and finish your summary with a concluding sentence."""
+
+    # Use the Ollama model to generate a summary
+    response = ollama.chat(
+        model=OLLAMA_MODEL,
+        messages=[
+            {
+                "role": "system",
+                "content": system_prompt,
+            },
+            {
+                "role": "user",
+                "content": user_prompt,
+            },
+        ],
+    )
+    # Print the generated summary
+    return response["message"]["content"]
+
+def main():
+    # Parse command line arguments
+    parser = argparse.ArgumentParser(description="Download, transcribe, and summarize audio or video files.")
+    group = parser.add_mutually_exclusive_group(required=True)
+    group.add_argument("--from-youtube", type=str, help="YouTube URL to download.")
+    group.add_argument("--from-local", type=str, help="Path to the local audio file.")
+    parser.add_argument("--output", type=str, default="./summary.md", help="Output markdown file path.")
+    
+    args = parser.parse_args()
+    
+    # Set up data directory
+    data_directory = Path("tmp")
+  
+    if args.from_youtube:
+        # Download from YouTube
+        print(f"Downloading YouTube video from {args.from_youtube}")
+        download_from_youtube(args.from_youtube, str(data_directory))
+        file_path = data_directory / "to_transcribe.mp4"
+    elif args.from_local:
+        # Use local file
+        file_path = args.from_local
+    
+    print(f"Transcribing file: {file_path}")
+    # Transcribe the audio file
+    transcript = transcribe_file(str(file_path), data_directory / "transcript.txt")
+    
+    print("Generating summary...")
+    # Generate summary
+    summary = summarize_text(transcript, "./")
+    
+    # Write summary to a markdown file
+    with open(args.output, "w") as md_file:
+        md_file.write("# Summary\n\n")
+        md_file.write(summary)
+        print(f"Summary written to {args.output}")
+
+if __name__ == "__main__":
+    main()