feat: add code samples for tuning with intermediate checkpoints #13366

yishan-pu · 2025-05-13T23:45:18Z

Description

Fixes #

Note: Before submitting a pull request, please open an issue for discussion if you are not associated with Google.

Checklist

I have followed Sample Guidelines from AUTHORING_GUIDE.MD
README is updated to include all relevant information
Tests pass: nox -s py-3.9 (see Test Environment Setup)
Lint pass: nox -s lint (see Test Environment Setup)
These samples need a new API enabled in testing projects to pass (let us know which ones)
These samples need a new/updated env vars in testing projects set to pass (let us know which ones)
This sample adds a new sample directory, and I updated the CODEOWNERS file with the codeowners for this sample
This sample adds a new Product API, and I updated the Blunderbuss issue/PR auto-assigner with the codeowners for this sample
Please merge this PR for me once it is approved

snippet-bot · 2025-05-13T23:45:27Z

Here is the summary of changes.

You are about to add 5 region tags.

genai/tuning/tuning_with_checkpoints_create.py:17, tag googlegenaisdk_tuning_with_checkpoints_create
genai/tuning/tuning_with_checkpoints_get_model.py:17, tag googlegenaisdk_tuning_with_checkpoints_get_model
genai/tuning/tuning_with_checkpoints_list_checkpoints.py:17, tag googlegenaisdk_tuning_with_checkpoints_list_checkpoints
genai/tuning/tuning_with_checkpoints_set_default_checkpoint.py:17, tag googlegenaisdk_tuning_with_checkpoints_set_default
genai/tuning/tuning_with_checkpoints_textgen_with_txt.py:17, tag googlegenaisdk_tuning_with_checkpoints_test

This comment is generated by snippet-bot.
If you find problems with this result, please file an issue at:
https://github.com/googleapis/repo-automation-bots/issues.
To update this comment, add snippet-bot:force-run label or use the checkbox below:

Refresh this comment

gemini-code-assist

Hello @yishan-pu, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

Summary of Changes

This pull request introduces code samples for tuning models with intermediate checkpoints using the google-cloud-ai-generative library. It adds new files demonstrating how to create a tuning job that exports intermediate checkpoints, retrieve a tuned model with checkpoints, list checkpoints, set a default checkpoint, and test a model against different checkpoints. Additionally, it modifies an existing file to print checkpoint information.

Highlights

New Samples: Adds new code samples for creating tuning jobs with checkpoints, retrieving tuned models, listing checkpoints, setting default checkpoints, and testing models against checkpoints.
Checkpoint Handling: Demonstrates how to access and utilize intermediate checkpoints during model tuning.
API Usage: Showcases the usage of google-cloud-ai-generative library for tuning jobs and model management.

Changelog

Click here to see the changelog

genai/tuning/tuning_job_create.py
- Added code to print checkpoint information if checkpoints are available in the tuning job.
genai/tuning/tuning_with_checkpoints_create.py
- Created a new sample demonstrating how to create a tuning job that exports intermediate checkpoints.
- Includes code to monitor the tuning job's state and print model, endpoint, and experiment information.
- Prints checkpoint details if available.
genai/tuning/tuning_with_checkpoints_get_model.py
- Created a new sample demonstrating how to retrieve a tuned model with checkpoint information.
- Prints the default checkpoint ID and details of all available checkpoints.
genai/tuning/tuning_with_checkpoints_list_checkpoints.py
- Created a new sample demonstrating how to list the checkpoints associated with a tuning job.
- Prints details of each checkpoint.
genai/tuning/tuning_with_checkpoints_set_default_checkpoint.py
- Created a new sample demonstrating how to set the default checkpoint for a tuned model.
- Updates the model configuration with the new default checkpoint ID.
genai/tuning/tuning_with_checkpoints_textgen_with_txt.py
- Created a new sample demonstrating how to test a tuned model against different checkpoints.
- Generates content using the default checkpoint and specific intermediate checkpoints.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Checkpoints mark the way,
Through tuning's winding maze,
Models learn and grow.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

The pull request introduces code samples for tuning with intermediate checkpoints, which is a valuable addition to the documentation. The new files demonstrate how to create tuning jobs with checkpoints, retrieve tuned models, list checkpoints, set default checkpoints, and test checkpoints. Overall, the code is well-structured and easy to understand. However, there are a few areas that could be improved for clarity and efficiency.

Summary of Findings

Code Duplication: The code blocks for printing checkpoint information are duplicated in tuning_job_create.py and tuning_with_checkpoints_create.py. Consider refactoring this into a shared function or utility.
Error Handling: The input() calls in the if __name__ == "__main__" blocks lack error handling. Add error handling to gracefully handle invalid user input.
Hardcoded Indices: The test_checkpoint function uses hardcoded indices to access checkpoints. Consider iterating through the checkpoints dynamically to make the code more flexible.

Merge Readiness

The pull request introduces valuable code samples for tuning with intermediate checkpoints. However, there are a few areas that could be improved for clarity, efficiency, and robustness. I recommend addressing the code duplication, error handling, and hardcoded indices issues before merging. I am unable to approve this pull request, and recommend that others review and approve this code before merging.

gemini-code-assist · 2025-05-13T23:45:53Z

genai/tuning/tuning_with_checkpoints_create.py

+    while tuning_job.state in running_states:
+        print(tuning_job.state)
+        tuning_job = client.tunings.get(name=tuning_job.name)
+        time.sleep(60)


The time.sleep(60) call could be interrupted by a signal. Consider using a more robust approach to waiting for the tuning job to complete, such as polling with a timeout or using a dedicated event loop.

genai/tuning/tuning_with_checkpoints_get_model.py

genai/tuning/tuning_with_checkpoints_list_checkpoints.py

genai/tuning/tuning_with_checkpoints_set_default_checkpoint.py

gemini-code-assist · 2025-05-13T23:45:53Z

genai/tuning/tuning_with_checkpoints_textgen_with_txt.py

+    tuning_job_name = input("Tuning job name: ")
+    test_checkpoint(tuning_job_name)


Consider adding error handling to the input() calls. If the user enters invalid input, the program will crash. It would be more robust to handle potential exceptions and provide informative error messages.

…-samples into checkpoint-samples

msampathkumar · 2025-05-14T16:35:25Z

genai/tuning/tuning_with_checkpoints_create.py

+        training_dataset="gs://cloud-samples-data/ai-platform/generative_ai/gemini-2_0/text/sft_train_data.jsonl",
+        config=CreateTuningJobConfig(
+            tuned_model_display_name="Example tuning job",
+            # Set to True to disable tuning intermediate checkpoints. Default is False.


3 issues

Little confusing words. Try something like

# Set `export_last_checkpoint_only` to False, to create intermediate checkpoints.

Instead of export_last_checkpoint_only, add_intermediate_checkpoints could be a better word choice.

The default value is None. https://github.com/googleapis/python-genai/blob/a3fc532594eff8f01749f6275c506f7516e8ab73/google/genai/types.py#L6890

Hi Sampath, export_last_checkpoint_only is the variable name defined by the Gen AI SDK, and aligns with the API and the UI.

msampathkumar · 2025-05-14T16:36:57Z

genai/tuning/tuning_with_checkpoints_create.py

+
+    tuning_job = client.tunings.tune(
+        base_model="gemini-2.0-flash-lite-001",
+        training_dataset="gs://cloud-samples-data/ai-platform/generative_ai/gemini-2_0/text/sft_train_data.jsonl",


Do not use gemini-2_0 (model family version or model versions) in the file name.

You can use gemini_sft or gemini_flash_sft

Thanks for the feedback! I'll update the filename to genai_flash_sft to better reflect the content.

msampathkumar · 2025-05-14T16:40:27Z

genai/tuning/tuning_with_checkpoints_get_model.py

+
+    # Get the tuning job and the tuned model.
+    # Eg. name = "projects/123456789012/locations/us-central1/tuningJobs/123456789012345"
+    tuning_job = client.tunings.get(name=name)


Kindly do not use Generic variable names like name. This is too difficult to understand.

Use something like job_name or tuning_job_id gives an idea of what is name

msampathkumar · 2025-05-14T16:41:43Z

genai/tuning/tuning_with_checkpoints_set_default_checkpoint.py

+# limitations under the License.
+
+
+def set_default_checkpoint(name: str, checkpoint_id: str) -> str:


Kindly do not use Generic variable names like name. This is too difficult to understand.

Use something like job_name or tuning_job_id gives an idea of what is name

msampathkumar · 2025-05-14T16:43:10Z

genai/tuning/tuning_with_checkpoints_textgen_with_txt.py

+# limitations under the License.
+
+
+def test_checkpoint(name: str) -> str:


Kindly do not use Generic variable names like name. This is too difficult to understand. Use something like job_name or tuning_job_id gives an idea of what is name

test is reserved word! Do not use test as prefix or suffix

msampathkumar · 2025-05-14T16:45:05Z

genai/tuning/tuning_with_checkpoints_textgen_with_txt.py

+        model=tuning_job.tuned_model.endpoint,
+        contents=contents,
+    )
+    print(response.text)


Missing example response

msampathkumar · 2025-05-14T16:45:24Z

genai/tuning/tuning_with_checkpoints_textgen_with_txt.py

+
+    contents = "Why is the sky blue?"
+
+    # Tests the default checkpoint


How is this a test?

msampathkumar · 2025-05-14T16:45:30Z

genai/tuning/tuning_with_checkpoints_textgen_with_txt.py

+    )
+    print(response.text)
+
+    # Tests Checkpoint 1


How is this a test?

msampathkumar · 2025-05-14T16:45:38Z

genai/tuning/tuning_with_checkpoints_textgen_with_txt.py

+    )
+    print(checkpoint1_response.text)
+
+    # Tests Checkpoint 2


How is this a test?

msampathkumar · 2025-05-14T16:45:45Z

genai/tuning/tuning_with_checkpoints_textgen_with_txt.py

+        model=tuning_job.tuned_model.checkpoints[0].endpoint,
+        contents=contents,
+    )
+    print(checkpoint1_response.text)


Missing example response

msampathkumar · 2025-05-14T16:45:49Z

genai/tuning/tuning_with_checkpoints_textgen_with_txt.py

+        model=tuning_job.tuned_model.checkpoints[1].endpoint,
+        contents=contents,
+    )
+    print(checkpoint2_response.text)


Missing example response

glasnt · 2025-05-14T22:56:24Z

Hi @msampathkumar, it looks like your review was on a merged PR. You may need to open a new PR with these suggested changes.

feat: add code samples for tuning with intermediate checkpoints

b8718be

yishan-pu requested review from a team as code owners May 13, 2025 23:45

product-auto-label bot added the samples Issues that are directly related to samples. label May 13, 2025

gemini-code-assist bot reviewed May 13, 2025

View reviewed changes

blunderbuss-gcf bot assigned glasnt May 13, 2025

gemini-code-assist bot suggested changes May 13, 2025

View reviewed changes

Yishan Pu added 2 commits May 13, 2025 16:46

feat: add code samples for tuning with intermediate checkpoints

bd6e7c8

Merge branch 'checkpoint-samples' of github.com:yishan-pu/python-docs…

f6c7900

…-samples into checkpoint-samples

glasnt approved these changes May 14, 2025

View reviewed changes

glasnt merged commit 286361a into GoogleCloudPlatform:main May 14, 2025
11 checks passed

msampathkumar reviewed May 14, 2025

View reviewed changes

genai/tuning/tuning_with_checkpoints_textgen_with_txt.py

)

print(response.text)

# Tests Checkpoint 1

Copy link

Member

msampathkumar May 14, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How is this a test?

msampathkumar reviewed May 14, 2025

View reviewed changes

glasnt assigned msampathkumar and unassigned glasnt May 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add code samples for tuning with intermediate checkpoints #13366

feat: add code samples for tuning with intermediate checkpoints #13366

yishan-pu commented May 13, 2025 •

edited

Loading

snippet-bot bot commented May 13, 2025 •

edited

Loading

gemini-code-assist bot left a comment

gemini-code-assist bot left a comment

gemini-code-assist bot May 13, 2025

gemini-code-assist bot May 13, 2025

msampathkumar May 14, 2025

yishan-pu May 14, 2025

msampathkumar May 14, 2025

gemini-code-assist bot May 14, 2025

msampathkumar May 14, 2025

msampathkumar May 14, 2025

msampathkumar May 14, 2025

msampathkumar May 14, 2025

msampathkumar May 14, 2025

msampathkumar May 14, 2025

msampathkumar May 14, 2025

msampathkumar May 14, 2025

msampathkumar May 14, 2025

glasnt commented May 14, 2025

		tuning_job_name = input("Tuning job name: ")
		test_checkpoint(tuning_job_name)

		# limitations under the License.


		def set_default_checkpoint(name: str, checkpoint_id: str) -> str:

		# limitations under the License.


		def test_checkpoint(name: str) -> str:


		contents = "Why is the sky blue?"

		# Tests the default checkpoint

feat: add code samples for tuning with intermediate checkpoints #13366

feat: add code samples for tuning with intermediate checkpoints #13366

Conversation

yishan-pu commented May 13, 2025 • edited Loading

Description

Checklist

snippet-bot bot commented May 13, 2025 • edited Loading

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Changelog

Footnotes

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Summary of Findings

Merge Readiness

gemini-code-assist bot May 13, 2025

Choose a reason for hiding this comment

gemini-code-assist bot May 13, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gemini-code-assist bot May 14, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

glasnt commented May 14, 2025

yishan-pu commented May 13, 2025 •

edited

Loading

snippet-bot bot commented May 13, 2025 •

edited

Loading