IONOS AI Model Hub

Your gateway to a secure multimodal AI platform

  • One platform for the most powerful AI models
  • Fair and transparent token-based pricing
  • No vendor lock-in with open source
Try now

Unleash the full power of AI securely and efficiently

The AI platform for accessing leading open source generative models for generating texts and images.

Top-level data security

Committed to reliability

  • Safeguard your business data within the secure IONOS Cloud
  • No third-party access to your personal data
  • Reliable hosting in IONOS data centers

User-friendly REST APIs

Easy integration

  • Full flexibility for integration into your applications
  • High compatibility with standard solutions
  • Straightforward handling via interfaces

Open-source technology

No vendor lock-in

  • Choose from high-performance large language & text-to-image models
  • Access to the latest and most reliable open-source models
  • No licensing fees for using deployed models

Personalized results

Utilize your own data

  • Link and enhance large language models with your own data directly
  • Filter results with Retrieval Augmented Generation (RAG)

Intelligent vector search

Precisely identify patterns

  • Identify semantically similar terms, texts, or entire contexts
  • Advanced data analysis and personalized search for AI and ML

Secure vector database

Use specialized AI databases

  • Store and index various data types as vectors
  • Fully managed solution for high performance and scalability

IONOS stands for secure and sustainable cloud solutions

Step-by-step to your LLM from the IONOS Cloud

  1. Query: Your query is sent to our powerful vector database to identify relevant information.

  2. Retrieve document: The vector database accesses the data and retrieves the most relevant documents for your query.

  3. LLM prompt: The retrieved documents are passed to your chosen large language model (LLM) to create a tailored response.

  4. Response: The response generated by the LLM is returned and ready for immediate use.

Fair pricing based on usage

Benefit from a token-per-use pricing model.

Large language models Free until March 31, 2025
Model category
Model
Price per 1 million input tokens Price from 1.4.2025
Price per 1 million output tokens Price from 1.4.2025

Standard

Llama 3.1 8B Instruct, Mistral 7B Instruct

$0 ($0.23)

$0 ($0.39)

Plus

Code Llama 13b Instruct HF, Mixtral 8x7B Instruct

$0 ($0.69)

$0 ($1.00)

Premium

Llama 3.1 70B Instruct, Llama 3.1 405B Instruct

$0 ($2.31)

$0 ($2.70)

Standard

Model category

Model

Llama 3.1 8B Instruct, Mistral 7B Instruct

Price per 1 million input tokens Price from 1.4.2025

$0 ($0.23)

Price per 1 million output tokens Price from 1.4.2025

$0 ($0.39)

Plus

Model category

Model

Code Llama 13b Instruct HF, Mixtral 8x7B Instruct

Price per 1 million input tokens Price from 1.4.2025

$0 ($0.69)

Price per 1 million output tokens Price from 1.4.2025

$0 ($1.00)

Premium

Model category

Model

Llama 3.1 70B Instruct, Llama 3.1 405B Instruct

Price per 1 million input tokens Price from 1.4.2025

$0 ($2.31)

Price per 1 million output tokens Price from 1.4.2025

$0 ($2.70)

A token typically represents a unit of text processed by an AI model during inference. It can be a word, character, or another unit, depending on the model and language.

Text-to-image Free until March 31, 2025
Model
Price per image
Price per image from 1.4.2025

Stable Diffusion XL, FLUX.1 [schnell] New

$0

$0.032

Stable Diffusion XL, FLUX.1 [schnell] New

Model

Price per image

$0

Price per image from 1.4.2025

$0.032

Data collections Free until March 31, 2025
Embedding models
Price per 1 million tokens
Price per 1 million tokens from 1.4.2025

paraphrase-multilingual-mpnet-base-v2

$0

$0.03

bge-large-en-v1.5

$0

$0.03

bge-m3

$0

$0.123

paraphrase-multilingual-mpnet-base-v2

Embedding models

Price per 1 million tokens

$0

Price per 1 million tokens from 1.4.2025

$0.03

bge-large-en-v1.5

Embedding models

Price per 1 million tokens

$0

Price per 1 million tokens from 1.4.2025

$0.03

bge-m3

Embedding models

Price per 1 million tokens

$0

Price per 1 million tokens from 1.4.2025

$0.123

Vector database
Price per 1 million tokens
Price per 1 million tokens from 1.4.2025

ChromaDB query

$0

$0.03

ChromaDB query

Vector database

Price per 1 million tokens

$0

Price per 1 million tokens from 1.4.2025

$0.03

Data collections storage
Vector database
Price per 1 million tokens

ChromaDB storage

$0.0154 / 30 days

ChromaDB storage

Vector database

Price per 1 million tokens

$0.0154 / 30 days

Terms and conditions - Free of charge until March 31, 2025:

  • This campaign is valid from December 1, 2024, to March 31, 2025, and is available to both new and existing customers of the IONOS AI Model Hub during the campaign period.

  • Customers will receive a 100% discount on the usage of AI Model Hub, with the exception of ChromaDB storage which is excluded from this promotion and will be charged at the normal rate.

  • IONOS reserves the right to impose reasonable limits on usage to maintain the performance and integrity of the platform, and it may adjust these limits or restrict the inflow of new customers at its discretion to ensure optimal service quality. After the campaign period ends on March 31, 2025, all pricing will revert to the standard rates as published on the IONOS website.

  • This offer is non-transferable and cannot be combined with any other promotions or discounts. IONOS also reserves the right to modify or cancel this promotion at any time without prior notice. By participating in this campaign, customers agree to comply with all IONOS policies and guidelines.

The best AI for your applications in the IONOS Cloud

You're always a step ahead with the IONOS AI Model Hub.

Use case

Conduct comprehensive market analysis

Use our LLMs to analyze and summarize your documents effectively.

  • Use information from external sources like websites
  • Link your LLM with ChromaDB
  • Develop your business based on new insights

Use case

Create knowledge databases for patents

Build an application that provides patents or other confidential information within your company.

  • Import and store patents as plain text in ChromaDB
  • Find relevant material with a search string
  • Keep track of your data, always

Use case

Automate customer service with AI

Use your company's internal knowledge base for chatbot-supported customer inquiries.

  • Export articles as text and import them into ChromaDB
  • Enhance customer engagement with quick, personalized responses
  • Let the IONOS AI Model Hub's LLM handle valuable conversations

Documentation and guides

Getting started

Get a complete overview and maximize your data with the IONOS AI Model Hub's first steps for successful integration and use.

API interface

Integrate the desired LLMs into your applications with the IONOS AI Model Hub API and optimize them. Find out everything you need to know here.

Enrich your apps

  • Register for the IONOS Public Cloud
  • Get full access to the IONOS AI Model Hub API
  • Connect powerful AI models to your application

The IONOS Cloud in practice

Powerful and future-proof cloud infrastructure that many customers trust.

A variety of AI models with endless possibilities

Discover the advantages of our open-source-based AI solutions.

Generate high-quality content in seconds with an open-source language model from our AI Model Hub.

  • Texts written like a native
  • Creative writing and storytelling
  • Code generation

Reduce lengthy articles, paragraphs, or documents to their essential points with LLMs.

  • Summaries
  • Abstracts
  • Briefs

Use LLMs to select similar texts based on content or context and categorize them effectively.

  • Sentiment analysis
  • Topic labelling
  • Tagging content

With the AI Model Hub, you can choose relevant texts based on predefined criteria and fine-tune the accuracy of search results.

  • Implement a relevance ranking
  • Enhanced query understanding with LLMs

Store and process text elements more efficiently than with SQL or NoSQL databases using a vector database.

  • Identify relevant content for queries and searches
  • Provide content to an LLM for processing

Benefit from precise and customized image generation with AI, creating unique visuals in no time.

  • Convert existing texts into visual representations
  • Create images based on text inputs
Questions about the IONOS Cloud?
Sales support
1-866-991-2631

Our product experts are here from 9:00am–5:00pm, Monday to Friday.

Technical support
267-481-7981

We're standing by to help, 24/7.