deepseek v3

Run DeepSeek V3 on Windows: Step by Step Installation Guide

Anas Mohammad

Mar 25, 2025 • 3 min read

DeepSeek V3 is a cutting-edge AI model designed for advanced reasoning, code generation, and multimodal understanding. Running it on a Windows machine allows users to leverage its powerful capabilities locally, ensuring privacy, speed, and customization.

This guide provides an in-depth walkthrough of how to set up and use DeepSeek V3 on Windows.

What is DeepSeek V3?

DeepSeek V3 is a large language model (LLM) built on a mixture-of-experts (MoE) architecture with 671 billion parameters. It excels in tasks such as:

Code generation and analysis
Mathematical problem-solving
Document summarization
Content generation

Key Features

Multi-head latent attention (MLA): Improves inference efficiency by focusing on multiple aspects of input simultaneously.
Auxiliary-loss-free load balancing: Ensures stable training without auxiliary loss functions.
Multi-token prediction: Enables faster inference through speculative decoding.
Processing speed: Handles 60 tokens per second, three times faster than its predecessor, DeepSeek V2.

Why Run DeepSeek Locally on Windows?

Running DeepSeek locally offers several benefits:

Privacy: Data remains on your machine without external server dependency.
Speed: Local processing is faster compared to cloud-based solutions.
Customization: Users can tweak settings for specific needs.
Cost Savings: Avoid expensive cloud computing fees.

System Requirements

Minimum Requirements:

Operating System: Windows 10 or 11 (64-bit)
Python Version: Python 3.9 or higher
GPU: NVIDIA GPU with at least 16GB VRAM (for the 7B model)
RAM: 32GB
Storage: 50GB free disk space

Recommended Setup:

Operating System: Windows 11
GPU: NVIDIA GPU with 24GB VRAM
RAM: 64GB
Storage: 100GB SSD space

Step-by-Step Installation Guide

Step 1: Install Required Software

Install Python 3.9:

Download Python 3.9 from the official website.
Select the "Windows installer (64-bit)" option.
During installation:
- Check "Add Python 3.9 to PATH."
- Click "Install Now."
Verify installation by running python --version in Command Prompt.

Install Git:

Download Git from git-scm.com.
Follow default installation options.
Verify installation by running git --version.

Install CUDA Toolkit:

Download CUDA Toolkit version 11.8 from NVIDIA's archive.
Choose "exe (local)" for installation.
Verify installation by running nvcc --version.

Step 2: Clone the Repository

Open Command Prompt and run:

git clone https://github.com/khanfar/DeepSeek-Windows.git
cd DeepSeek-Windows

Create a virtual environment:

python -m venv venv

Activate the virtual environment:

venv\Scripts\activate

Step 3: Install Dependencies

Install PyTorch:

Run the following command to install PyTorch compatible with CUDA:

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

Install Additional Requirements:

Install other dependencies listed in the repository's requirements.txt file:

pip install -r requirements.txt

Step 4: Download and Configure the Model

Download Model Weights:

Run the download script provided in the repository:

python download_model.py

This will download the 7B parameter model (~14GB).

Convert Model Format (Optional):

If required, convert the model weights for optimized performance:

python fp8_cast_bf16.py --input-fp8-hf-path model_weights --output-bf16-hf-path model_weights_bf16

Step 5: Start the Server

Run the server to activate DeepSeek V3 locally:

python windows_server.py --model model_weights_bf16 --trust-remote-code

The server will start at http://127.0.0.1:30000.

Step 6: Test the Model

Use the provided test client script to verify functionality:

python test_client.py

Alternatively, interact with DeepSeek via its web interface or API.

Using DeepSeek V3 Coder

DeepSeek V3 Coder specializes in programming assistance, code debugging, and software development tasks.

Access Methods:

Web Browser Interface:
- Visit www.deepseekv3.com and use its chat interface for queries.
API Integration:
- Register for an API key on DeepSeek's platform.
- Use OpenAI-compatible SDKs to interact with the model programmatically.

Example API call using Python:

from openai import OpenAI

client = OpenAI(api_key="", base_url="https://api.deepseek.com")

response = client.chat.completions.create(
    model="deepseek-chat",
    messages=[
        {"role": "system", "content": "You are a helpful assistant"},
        {"role": "user", "content": "Generate Python code for a calculator"},
    ],
    stream=False,
)

print(response['choices'][0]['message']['content'])

Troubleshooting Common Issues

Issue: Model Not Loading Properly

Ensure GPU drivers are up-to-date.
Verify CUDA Toolkit compatibility with your GPU.

Issue: Slow Performance

Upgrade RAM or GPU specifications for better processing speeds.

Issue: Dependency Errors During Installation

Double-check Python version and installed packages using pip list.

Conclusion

Running DeepSeek V3 on Windows unlocks powerful AI capabilities locally, enhancing productivity and privacy while reducing costs associated with cloud computing.

By following this guide, users can successfully set up and leverage DeepSeek’s advanced features for coding, mathematical reasoning, and more.