zsxkib/jina-clip-v2

Jina-CLIP v2: 0.9B multimodal embedding model with 89-language multilingual support, 512x512 image resolution, and Matryoshka representations

Input
Configure the inputs for the AI model.

Text content to embed (up to 8192 tokens). If both text and image provided, text embedding will be first in returned list.

Image file to embed (optimal size: 512x512). If both text and image provided, image embedding will be second in returned list.

64
1024

Matryoshka dimension - output embedding dimension (64-1024)

Format to use in outputs

Output
The generated output will appear here.

No output yet

Click "Generate" to create an output.

jina-clip-v2 - ikalos.ai