llama-3.3-70b

2024-12-01
Chat
By Meta

Input: ￥10.00 / M tokens Output: ￥20.00 / M tokens
Features: Streaming, Text Input, Text Output
Context Window: 128K
Maximum Output: 4K

Input: ￥10.00 / M tokens Output: ￥20.00 / M tokens
Features: Streaming, Text Input, Text Output
Context Window: 128K
Maximum Output: 4K

Meta Llama 3.3 is a state-of-the-art 70 billion parameter multilingual large language model (LLM) designed for text generation tasks. As an instruction-tuned variant of the Llama architecture, it specializes in assistant-like dialogue applications across English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. The model employs an optimized transformer architecture with Grouped-Query Attention (GQA) for efficient inference, trained on over 15 trillion tokens of publicly available data with a knowledge cutoff in December 2023. It leverages both supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align responses with human preferences for helpfulness and safety. Notable features include a 128k token context window, tool calling capabilities, and compliance with Meta’s custom commercial license (Llama 3.3 Community License). The model demonstrates strong performance on industry benchmarks while explicitly prohibiting unlawful uses or applications in unsupported languages without proper safety measures.

🔔How to Use

graph LR A("Purchase Now") --> B["Start Chat on Homepage"] A --> D["Read API Documentation"] B --> C["Register / Login"] C --> E["Enter Key"] D --> F["Enter Endpoint & Key"] E --> G("Start Using") F --> G style A fill:#f9f9f9,stroke:#333,stroke-width:1px style B fill:#f9f9f9,stroke:#333,stroke-width:1px style C fill:#f9f9f9,stroke:#333,stroke-width:1px style D fill:#f9f9f9,stroke:#333,stroke-width:1px style E fill:#f9f9f9,stroke:#333,stroke-width:1px style F fill:#f9f9f9,stroke:#333,stroke-width:1px style G fill:#f9f9f9,stroke:#333,stroke-width:1px

llama-3.3-70b

Model Description