BadRequestError due to move to OpenAI Chat completions API

### System Info

Running with latest llamastack main branch

### Information

- [ ] The official example scripts
- [ ] My own modified scripts

### 🐛 Describe the bug

The move to OpenAI chat completions API (https://github.com/llamastack/llama-stack/commit/7e48cc48bc13a5670ac0b15ab5ad69582736d81a) broke the requests as it is sending a wong max_output_tokens: Getting the next BadRequestError:
openai.BadRequestError: Error code: 400 - [{'error': {'code': 400, 'message': '* GenerateContentRequest.generation_config.max_output_tokens: max_output_tokens must be positive.\n', 'status': 'INVALID_ARGUMENT'}}]

### Error logs

```
openai.BadRequestError: Error code: 400 - [{'error': {'code': 400, 'message': '* GenerateContentRequest.generation_config.max_output_tokens: max_output_tokens must be positive.\n', 'status': 'INVALID_ARGUMENT'}}]
```

### Expected behavior

If max token is not specified in the llamastack config it should simply pass None and allow the request to go through as before the referenced commit

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BadRequestError due to move to OpenAI Chat completions API #3666

System Info

Information

🐛 Describe the bug

Error logs

Expected behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

BadRequestError due to move to OpenAI Chat completions API #3666

Description

System Info

Information

🐛 Describe the bug

Error logs

Expected behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions