Volume 4,Issue 3
Research Review on Practical Course Design for Localization Deployment of Open-source Large Language Models — Practical Path of Industry-Education Collaboration Based on Ollama/vLLM Toolchain
Addressing the widespread reliance on cloud-based Application Programming Interfaces (APIs) in AI courses at vocational colleges and students’ limited practical experience in localized model deployment, this study systematically reviews the research status and implementation approaches of localized deployment of open-source large language models (LLMs) in vocational AI training programs. Through bibliometric analysis, case study analysis, and theoretical framework construction, we synthesize domestic and international research findings since 2020, focusing on the technological evolution of open-source LLMs, existing challenges in vocational AI education, and critical elements of localized course design. The study reveals a significant structural imbalance in vocational AI education characterized by “technological maturity versus pedagogical lag”: while open-source models like LLaMA3 and Qwen2.5 can efficiently run on consumer-grade GPUs, and toolchains such as Ollama and vLLM have substantially reduced deployment barriers, over 90% of institutions still operate AI courses at the cloud API invocation level. Building on this, we propose a localized course design framework encompassing three dimensions: technology stack selection, curriculum module design, and industry-academia collaboration mechanisms. The Ollama platform integrated with the vLLM toolchain employs Low-Rank Adaptation (LoRA) lightweight fine-tuning technology, representing the most technically viable and pedagogically applicable localized AI training solution for vocational colleges. The school-enterprise “dual-track” curriculum collaboration mechanism forms the institutional foundation for ensuring high-quality implementation of such courses. The proposed “theory-instrument-practice” three-stage framework provides replicable reference models for peer institutions across the province.
[1] Wang H, Lu W, Wang C, 2025, Design and Implementation of Distributed Local Large Model Services Based on OpenWebUI and LiteLLM. Yangtze River Information and Communications, 38(12): 20–24.
[2] Deng Y, 2025, Research on Information Security Risk Control Strategies for Localized Deployment of Large Models in Archival Management. Heilongjiang Archives, 2025(5): 141–143.
[3] Jin H, Cui C, 2025, Research on Customized AI Large Models Based on Localization and Privatized Databases. Architectural Technology, 56(20): 2480–2482.
[4] Song H, 2025, IT Operations Knowledge Base Based on Localized Deployment of Artificial Intelligence Models. Computer Programming Techniques and Maintenance, 2025(9): 122–124 + 132.
[5] Li R, Yang J, 2025, Research on Localization Deployment and Security Protection System Based on DeepSeek Large Model. Wireless Interconnection Technology, 22(17): 1–6.
[6] Tao X, 2024, Research on Large Language Model-Based Intelligent Q&A System Based on Hybrid Architecture. Post and Telecommunications Design Technology, 2024(5): 48–55.