Research Review on Practical Course Design for Localization Deployment of Open-source Large Language Models — Practical Path of Industry-Education Collaboration Based on Ollama/vLLM Toolchain

Genyuan Wang; Wenshuang Li

doi:10.18063/EIR.v3i11.1702

ARTICLE

Volume 4,Issue 3

All Issues

Submit to EIR

Cite this article

Citations

Views

More by Authors Links

20 December 2025

Research Review on Practical Course Design for Localization Deployment of Open-source Large Language Models — Practical Path of Industry-Education Collaboration Based on Ollama/vLLM Toolchain

Genyuan Wang¹, Wenshuang Li¹

Show Less

¹ Hainan Vocational University of Science and Technology, Haikou 571126, Hainan, China

EIR 2025 , 3(11), 11–20; https://doi.org/10.18063/EIR.v3i11.1702

© 2025 by the Author. Licensee Whioce Publishing, Singapore. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution 4.0 International License ( https://creativecommons.org/licenses/by/4.0/ )

Abstract

Addressing the widespread reliance on cloud-based Application Programming Interfaces (APIs) in AI courses at vocational colleges and students’ limited practical experience in localized model deployment, this study systematically reviews the research status and implementation approaches of localized deployment of open-source large language models (LLMs) in vocational AI training programs. Through bibliometric analysis, case study analysis, and theoretical framework construction, we synthesize domestic and international research findings since 2020, focusing on the technological evolution of open-source LLMs, existing challenges in vocational AI education, and critical elements of localized course design. The study reveals a significant structural imbalance in vocational AI education characterized by “technological maturity versus pedagogical lag”: while open-source models like LLaMA3 and Qwen2.5 can efficiently run on consumer-grade GPUs, and toolchains such as Ollama and vLLM have substantially reduced deployment barriers, over 90% of institutions still operate AI courses at the cloud API invocation level. Building on this, we propose a localized course design framework encompassing three dimensions: technology stack selection, curriculum module design, and industry-academia collaboration mechanisms. The Ollama platform integrated with the vLLM toolchain employs Low-Rank Adaptation (LoRA) lightweight fine-tuning technology, representing the most technically viable and pedagogically applicable localized AI training solution for vocational colleges. The school-enterprise “dual-track” curriculum collaboration mechanism forms the institutional foundation for ensuring high-quality implementation of such courses. The proposed “theory-instrument-practice” three-stage framework provides replicable reference models for peer institutions across the province.

Keywords

Open-source large language model

Localized deployment

Vocational AI education

Industry-education collaborative training

LoRA fine-tuning

Ollama

Funding

Ministry of Education Industry-Academia Cooperation Collaborative Talent Cultivation Project: “Training Course Design for Localized Deployment of Open-Source AI Models” (Project No.: 250700409014913)

References

[1] Wang H, Lu W, Wang C, 2025, Design and Implementation of Distributed Local Large Model Services Based on OpenWebUI and LiteLLM. Yangtze River Information and Communications, 38(12): 20–24.

[2] Deng Y, 2025, Research on Information Security Risk Control Strategies for Localized Deployment of Large Models in Archival Management. Heilongjiang Archives, 2025(5): 141–143.

[3] Jin H, Cui C, 2025, Research on Customized AI Large Models Based on Localization and Privatized Databases. Architectural Technology, 56(20): 2480–2482.

[4] Song H, 2025, IT Operations Knowledge Base Based on Localized Deployment of Artificial Intelligence Models. Computer Programming Techniques and Maintenance, 2025(9): 122–124 + 132.

[5] Li R, Yang J, 2025, Research on Localization Deployment and Security Protection System Based on DeepSeek Large Model. Wireless Interconnection Technology, 22(17): 1–6.

[6] Tao X, 2024, Research on Large Language Model-Based Intelligent Q&A System Based on Hybrid Architecture. Post and Telecommunications Design Technology, 2024(5): 48–55.

Previous article in this issue

Next article in this issue