HeatWave User Guide  /  ...  /  Supported Languages, Embedding Models, and LLMs

4.2.2 Supported Languages, Embedding Models, and LLMs

This section provides the list of languages that HeatWave GenAI supports and the embedding models as well as large language models (LLMs) that are available in HeatWave.

Languages

As of MySQL 9.0.0, HeatWave GenAI supports natural-language communication, ingesting documents, as well as generating text-based content in English (en).

As of MySQL 9.0.1-u1, HeatWave GenAI supports natural-language communication, ingesting documents, as well as generating text-based content in multiple languages. The quality of the generated text outputs depends on the training and ability of the LLM to work with the language.

Following is a combined list of languages supported by HeatWave In-Database LLMs and OCI Generative AI Service LLMs:

  • Arabic (ar)

  • Bengali (bn)

  • Burmese (my)

  • Chinese (zh)

  • Czech (cs)

  • Dutch (nl)

  • English (en)

  • French (fr)

  • German (de)

  • Hebrew (he)

  • Hindi (hi)

  • Indonesian (id)

  • Italian (it)

  • Japanese (ja)

  • Khmer (km)

  • Korean (ko)

  • Lao (lo)

  • Malay (ms)

  • Persian (fa)

  • Polish (pl)

  • Portuguese (pt)

  • Spanish (es)

  • Tagalog (tl)

  • Thai (th)

  • Turkish (tr)

  • Urdu (ur)

  • Vietnamese (vi)

However, not all LLMs support all the languages. To learn which LLM supports which language, see HeatWave In-Database LLMs and OCI Generative AI Service LLMs.

Note

To set the value of the language parameter in HeatWave GenAI routines that support this parameter, do not use the language name to specify the language. Use the two-letter ISO 639-1 code for the language instead. For example, to use French, use the ISO 639-1 code for French, which is fr.

HeatWave In-Database Embedding Models

The following in-database embedding models are available in HeatWave:

  • minilm:

    • As of MySQL 9.2.1, for encoding text or files in any supported language.

    • In previous versions of MySQL, for encoding text or files in English only.

    HeatWave GenAI uses this embedding model, by default, for encoding English documents. all_minilm_l12_v2 is an alternative ID for the same model.

  • multilingual-e5-small:

    • As of MySQL 9.2.1, for encoding text or files in any supported language.

    • In previous versions of MySQL, for encoding text or files in supported languages other than English.

    HeatWave GenAI uses this embedding model, by default, for encoding documents in all supported languages other than English.

    This embedding model is available in MySQL 9.0.1-u1 and later versions.

OCI Generative AI Service Embedding Models

As of MySQL 9.0.1-u1, Oracle Cloud Infrastructure (OCI) Generative AI service embedding models are available in HeatWave.

Note

These models are available only in the regions supported by the OCI Generative AI service. For more information, see Pretrained Foundational Models in Generative AI.

However, to use the OCI Generative AI service with HeatWave GenAI, you need to enable the database system to access OCI services. For more information, see Section 4.2.3, “Authenticating OCI Generative AI Service”.

The following OCI Generative AI service embedding models are available in HeatWave:

  • cohere.embed-english-v3.0:

    • As of MySQL 9.2.1, for encoding text or files in any supported language.

    • In previous versions of MySQL, for encoding text or files in English only.

  • cohere.embed-multilingual-v3.0:

    • As of MySQL 9.2.1, for encoding text or files in any supported language.

    • In previous versions of MySQL, for encoding text or files in supported languages other than English.

HeatWave In-Database LLMs

The following in-database LLMs are available in HeatWave:

  • mistral-7b-instruct-v1

    • As of MySQL 9.0.0, supports English.

    • As of MySQL 9.0.1-u1, supports the following languages:

      • English (en)

      • French (fr)

      • German (de)

      • Spanish (es)

  • llama2-7b-v1

    • As of MySQL 9.0.0, supports English (en).

    • As of MySQL 9.0.1-u1, the support for llama2-7b-v1 has been deprecated.

  • llama3-8b-instruct-v1

    • This LLM is available in MySQL 9.0.1-u1 and later versions.

    • It supports the following languages:

      • English (en)

      • French (fr)

      • German (de)

      • Hindi (hi)

      • Italian (it)

      • Portuguese (pt)

      • Spanish (es)

      • Thai (th)

OCI Generative AI Service LLMs

HeatWave GenAI lets you use OCI Generative AI service LLMs, which support all available Languages.

However, to use the OCI Generative AI service with HeatWave GenAI, you need to enable the database system to access OCI services. For more information, see Section 4.2.3, “Authenticating OCI Generative AI Service”.

Note

These LLMs are available only in the regions supported by the OCI Generative AI service. For more information, see Pretrained Foundational Models in Generative AI.

As of MySQL 9.1.2, the following OCI Generative AI service LLMs are available in HeatWave:

  • cohere.command-r-08-2024

  • cohere.command-r-plus-08-2024

  • meta.llama-3.1-70b-instruct

  • meta.llama-3.1-405b-instruct

To learn about deprecation and removal times of OCI Generative AI service LLMs, see Retiring the Models .