Skip to main content

Glossary

The glossary provides you with core terms covering AI technologies and related fields, serving as a foundational reference for understanding product features and technical principles.

Term CategoryTerm Name/AbbreviationDefinition
Model & Algorithm TechnologyLoRA(Low-Rank Adaptation)A lightweight model fine-tuning technique that reduces fine-tuning parameters through low-rank matrix factorization, applicable to text-to-image, large language models, and other scenarios.
LLM(Large Language Model)Large-scale language models trained on massive text data, supporting tasks such as text generation and question answering, e.g., Qwen2.5, GLM-4.
SFT(Supervised Fine-Tuning)Optimizing pre-trained models using labeled data to improve performance on specific tasks, suitable for small dataset scenarios.
DPO(Direct Preference Optimization)Optimizing models based on human preference data to enhance the conversational quality of large language models.
TransformersAn architecture based on self-attention mechanisms, used for NLP tasks and one of the core architectures of LLMs.
DiffusersA generative image model library supporting text-to-image and image-to-image generation, such as Stable Diffusion.
Function Calling/Tool CallingThe capability of large models to invoke preset functions/tools, enabling cross-tool collaboration (e.g., calling a weather API).
Data & Training RelatedDatasetA structured collection of data used for model training, divided into public and private, supporting multiple formats.
Dataset CardA file (README.md) that records dataset metadata (license, language, etc.) to help understand the dataset.
Model CardA document describing model information, including use cases and limitations, hosted as the README.md in the model repository.
TokenThe basic unit for models to process text, used for calculating length and billing.
API & Service TechnologyServerless APIAPI services that eliminate the need for server management, billed by the number of calls, supporting functions such as text generation.
API WorkflowA visual workflow engine that connects multiple model APIs to automate tasks (e.g., "text→image" generation).
Access TokenIdentity credentials for API calls, divided into full-resource and partial authorization, with permission controlled by binding resource packages.
Model Context Protocol (MCP)A protocol that defines how models request tools to perform operations, interact with the environment, and obtain results from external systems.
Temporary TokenShort-term valid tokens, suitable for client-side calls with higher security.
Failover MechanismAutomatic switching to other compute resources when the primary resource fails, enabled via the request header X-Failover-Enabled.
Image and Multimedia TechnologyText-to-ImageGenerate images based on text, such as the FLUX.1 model, which can control effects through parameters.
Image-to-ImageGenerate new images based on the original image, supporting style transfer and local repainting.
OCR (Optical Character Recognition)Technology for extracting text from images, supporting multi-language, table, and formula recognition.
TTS (Text-to-Speech)Converting text to speech with voiceprint customization, such as the ChatTTS model.
ASR (Automatic Speech Recognition)Converting speech to text, supporting multiple languages and dialects.
Deployment & Computing RelatedModel EngineA managed model deployment service that supports custom computing power, suitable for production environments.
Edge DeviceEdge devices (e.g., mobile phones) that support the deployment of lightweight models for low-latency inference.
GPUParallel computing hardware suitable for model training and inference, such as NVIDIA A10.
vllmA high-performance LLM inference library that optimizes the concurrency and latency of conversational models.
DockerA containerization tool that packages applications and their environments, supporting custom configurations.
Parameter & Configuration RelatedTemperatureControls generation randomness (0-1); lower values result in stable output, while higher values lead to more creative results.
Top_pControls diversity (0-1), used in conjunction with temperature.
Guidance ScaleA parameter in image generation that controls the influence of text prompts.
JSON SchemaA specification that constrains the output format of models to ensure structured data generation.