Image input

POST

https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent

The Gemini API supports multimodal inputs that combine text and media files. The following example shows how to generate text from text and image input

Request Example

Shell

JavaScript

Java

Swift

curl --location -g --request POST 'https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=' \
--header 'Content-Type: application/json' \
--data-raw '@$TEMP_JSON'

Response Example

{}

Request

Query Params

key

string

required

Example:

Header Params

Content-Type

string

required

Example:

application/json

Body Params application/json

contents

array [object {1}]

required

parts

array [object {2}]

required

Examples

Responses

🟢200成功

application/json

Body

object {0}

Text input

Streaming output