[vertexai] Inconsistent behavior when using responseSchema with Gemini Flash models, malformed or repetitive JSON output unless schema is removed

We’ve encountered a reproducible issue when using the Java Vertex AI client (<code inline="">com.google.cloud:google-cloud-vertexai:1.18.0</code>) with Gemini Flash models for structured text generation.
When a <code inline="">responseSchema</code> is attached to the <code inline="">GenerationConfig</code>, the model intermittently produces malformed or repetitive JSON outputs, often looping text fragments or inserting stray newline escape sequences until the max output token limit is reached. 
Removing the schema entirely eliminates the issue, and the same prompt setup works correctly in the Python Vertex AI SDK, suggesting this may be SDK-specific or related to how the Java client serializes the schema.
<hr>
<h3>Environment details</h3>

Key | Value
-- | --
API | Vertex AI Generative AI (Java)
Library | com.google.cloud:google-cloud-vertexai:1.18.0
Java version | 21
OS | Windows 11
Models tested | Gemini 2.0 Flash, Gemini 2.5 Flash Lite, Gemini 2.5 Flash
Behavior | Issue occurs with 2.0 Flash and 2.5 Flash Lite; 2.5 Flash mitigates it partially


<hr>
<h3>Steps to reproduce</h3>
<ol>
<li>
Configure a <code inline="">GenerativeModel</code> with deterministic decoding:
<ul>
<li>
<code inline="">temperature = 0.0f</code>, <code inline="">topP = 0.0f</code>, <code inline="">topK = 1</code>, <code inline="">candidateCount = 1</code>, <code inline="">seed = 42</code>
</li>
<li>
<code inline="">responseMimeType = "application/json"</code>
</li>
</ul>
</li>
<li>
Attach a complex <code inline="">responseSchema</code> describing nested arrays and objects (see example below).
</li>
<li>
Send a document-extraction prompt requesting structured JSON per the schema.
</li>
<li>
Observe that:
<ul>
<li>
The model often ignores the schema’s structure.
</li>
<li>
Output becomes recursive or repetitive (<code inline="">"Company Company Company..."</code>).
</li>
<li>
Output terminates abruptly at token limit with unclosed quotes or brackets.
</li>
</ul>
</li>
<li>
Remove the schema (keep all other settings identical).
</li>
<li>
Observe that the output is now clean and well-formed JSON.
</li>
</ol>
<hr>
<h3>Code snippet (simplified)</h3>
<pre><code class="language-java">GenerationConfig cfg = GenerationConfig.newBuilder()
 .setTemperature(0.0f)
 .setTopP(0.0f)
 .setTopK(1)
 .setCandidateCount(1)
 .setSeed(42)
 .setResponseMimeType("application/json")
 .setResponseSchema(ResponseSchemaFactory.getExtractionSchema()) // When set, issue occurs
 .build();

GenerativeModel model = baseModel
 .withSystemInstruction(ContentMaker.fromString(systemPrompt))
 .withGenerationConfig(cfg);

GenerateContentResponse response = model.generateContent(promptText);
String jsonOutput = response.getText(); // often malformed
</code></pre>
Example schema shape:
<pre><code class="language-java">Schema workExperience = Schema.newBuilder()
 .setType(Type.OBJECT)
 .putProperties("company", Schema.newBuilder().setType(Type.STRING).build())
 .putProperties("tenure", Schema.newBuilder().setType(Type.STRING).build())
 .putProperties("skills", Schema.newBuilder()
 .setType(Type.ARRAY)
 .setItems(Schema.newBuilder().setType(Type.STRING).build())
 .build())
 .build();
</code></pre>
<hr>
<h3>Observed output (excerpt, simulated)</h3>
<pre><code class="language-json">{
 "experience": [
 {
 "company": "TechCorp TechCorp TechCorp TechCorp TechCorp ...",
 "tenure": "2 yrs",
 "skills": ["Java", "Spring Boot"]
 }
 ],
 "summary": "\n\n {\n.\n.\n\\n\\n\\n\\n\\n\\n\\n\\n\n"
}
</code></pre>
Occasionally, the output fails JSON parsing due to missing closing quotes or brackets:
<pre><code>com.fasterxml.jackson.core.io.JsonEOFException: Unexpected end-of-input:
 was expecting closing quote for a string value
 at [Source: (String)"{ "experience": [ { "company": "ABC
 "tenure": "3 yrs"...]; line: 1, column: 4211]
</code></pre>
<hr>
<h3>Expected behavior</h3>
When <code inline="">responseSchema</code> is provided, the model should consistently honor the schema and produce syntactically valid JSON following the defined structure.
<hr>
<h3>Additional context</h3>
<ul>
<li>
Removing the schema entirely fixes the problem.
</li>
<li>
Using identical prompts and schema definitions in Python Vertex AI SDK does not reproduce the issue.
</li>
<li>
Switching to Gemini 2.5 Flash improves output stability, possibly due to increased reasoning or token budget.
</li>
<li>
This suggests the issue may lie in schema serialization or how the Java SDK encodes the request payload.
</li>
</ul>
Would appreciate guidance on whether this is:
<ul>
<li>
A known limitation or bug in the Java Vertex AI client,
</li>
<li>
A misalignment between the Java SDK’s schema format and backend expectations,
</li>
<li>
Or a potential model-side behavior that needs handling guidance.
</li>
</ul>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[vertexai] Inconsistent behavior when using responseSchema with Gemini Flash models, malformed or repetitive JSON output unless schema is removed #11782

Environment details

Steps to reproduce

Code snippet (simplified)

Observed output (excerpt, simulated)

Expected behavior

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Key	Value
API	Vertex AI Generative AI (Java)
Library	com.google.cloud:google-cloud-vertexai:1.18.0
Java version	21
OS	Windows 11
Models tested	Gemini 2.0 Flash, Gemini 2.5 Flash Lite, Gemini 2.5 Flash
Behavior	Issue occurs with 2.0 Flash and 2.5 Flash Lite; 2.5 Flash mitigates it partially

[vertexai] Inconsistent behavior when using responseSchema with Gemini Flash models, malformed or repetitive JSON output unless schema is removed #11782

Description

Environment details

Steps to reproduce

Code snippet (simplified)

Observed output (excerpt, simulated)

Expected behavior

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions