Raise ValueError exception if bucket name is invalid. #3160

daspecster · 2017-03-16T20:04:09Z

dhermes · 2017-03-16T20:05:46Z

@daspecster I am 👎 on this change, not really sure what the right approach is but adding this check in the constructor makes EVERY constructed bucket pay the price

storage/google/cloud/storage/_helpers.py


    def __init__(self, name=None):
-        self.name = name
+        if name is None or (re.match(r'\w', name[0]) and


daspecster · 2017-03-16T20:07:13Z

@dhermes you mean every existing bucket? I don't believe there could be any existing buckets that violate this rule? I think the API would have complained.

#2956 (comment)

dhermes · 2017-03-16T20:09:23Z

@daspecster No I mean the local Bucket objects created by users of this library

tseaver · 2017-03-16T20:12:11Z

Quoting my follow up to #2956:

If the API allows for / in the bucket name, then we shouldn't be prohibiting such names: instead, we should be URL-escaping the bucket name when constructing the API request URL.

daspecster · 2017-03-16T20:12:35Z

@lukesneeringer @tseaver any alternative ideas?

ISTM that object creation would be the place to check this? Maybe there's a more efficient way to do it though?

dhermes · 2017-03-16T20:13:18Z

I think we can "fix" by just more clearly documenting the acceptable values.

tseaver · 2017-03-16T20:15:53Z

@dhermes If the API allows embedding / in a bucket name, then we have to honor that, which means that we need to be URL-escaping the bucket name when creating API URLs

daspecster · 2017-03-16T20:16:22Z

@tseaver a user might be confused when then see their bucket names listed from some other library that may/maynot handle the urlencoding/urldecoding right?

@dhermes sure, but the error that is returned is a 404 which doesn't explain what the allowed values are.
I'll update the docs to be more clear about the allowable name values since that should be in here anyway.

lukesneeringer · 2017-03-16T20:40:05Z

@tseaver, you are making an orthogonal point; in actuality, slashes are not permissible at the start or end of a bucket name (by the API), but the client library allows them to pass through.

tseaver · 2017-03-16T20:43:12Z

@lukesneeringer the OP in #2956 reported having created a bucket with a trailing slash.

lukesneeringer · 2017-03-16T20:45:51Z

@tseaver Yes, in our library, and then the API dropped the slash for the bucket name.

lukesneeringer · 2017-03-16T20:48:20Z

@daspecster I am 👎 on this change, not really sure what the right approach is but adding this check in the constructor makes EVERY constructed bucket pay the price

That seems fine to me. It is not terribly expensive.

lukesneeringer

Approved, pending @dhermes explaining his problem with it and resolving that.

dhermes · 2017-03-17T18:29:09Z

My issue is just that RegEx is expensive. In terms of this specific approach, we don't even need a regex:

name[0].isalnum()
name[-1].isalnum()

Though this is a little strange with unicode (2) / str (3)

>>> u'\xff'.isalnum()
True

lukesneeringer · 2017-03-17T18:43:11Z

Sold. Change to that, and add len(name) <= 222.

EDIT: We should also add all([len(i) <= 63 for i in name.split('.')]).

That is good enough. The API can complain about the remaining errors (e.g. "not containing google or a common misspelling thereof").

lukesneeringer · 2017-03-17T18:52:08Z

@daspecster Suggestion:

class _PropertyMixin(object):
   def __init__(self, name=None):
        if name:
            self._validate_name(name)
        [...]

    def _validate_name(self, name):
        # Names must start and end with a letter or number.
        if not name[0].isalnum() or not name[-1].isalnum():
            raise ValueError('Bucket names must start and end with an alphanumeric character.')

        # Names must be between 3 and 222 total characters in length.
        if len(name) < 3:
            raise ValueError('Bucket names must be at least 3 characters.')
        if len(name) > 222:
            raise ValueError('Bucket names can not exceed 222 characters.')

        # Each bucket name component can not exceed 63 characters.
        if any([len(i) > 63 for i in name.split('.')]):
            raise ValueError('Each dot-separated component in a bucket name '
                             'can not exceed 63 characters.')

lukesneeringer

Updated based on previous discussion.

daspecster · 2017-03-17T18:53:29Z

@lukesneeringer where are you seeing the 63 char limit in the docs?

lukesneeringer · 2017-03-17T18:54:08Z

Bucket names must contain 3 to 63 characters. Names containing dots can contain up to 222 characters, but each dot-separated component can be no longer than 63 characters.

daspecster · 2017-03-17T18:54:49Z

@lukesneeringer found the link. Thanks!

dhermes · 2017-03-17T18:56:16Z

There is just so much complexity here. Why can't we just punt to the server validation and document the rules in our docstring?

daspecster · 2017-03-17T18:59:09Z

To echo @dhermes, there are more requirements for the names as well and I assume they're subject to change(with or without bumping the API version).

Your bucket names must meet the following requirements:

Bucket names must contain only lowercase letters, numbers, dashes (-), underscores (_), and dots (.). Names containing dots require verification.
Bucket names must start and end with a number or letter.
Bucket names must contain 3 to 63 characters. Names containing dots can contain up to 222 characters, but each dot-separated component can be no longer than 63 characters.
Bucket names cannot be represented as an IP address in dotted-decimal notation (for example, 192.168.5.4).
Bucket names cannot begin with the "goog" prefix.
Bucket names cannot contain "google" or close misspellings of "google".
Also, for DNS compliance and future compatibility, you should not use underscores (_) or have a period adjacent to another period or dash. For example, ".." or "-." or ".-" are not valid in DNS names.

lukesneeringer · 2017-03-17T19:08:21Z

I guess what I was thinking was, "catch the easy ones, let the API fail on the rest".
The issue with the forward slash in particular is that it gets stripped by the API, which leads to non-obvious errors later. So, at minimum, we should catch the alphanumeric 0 and -1 indices.

I could go either way on the rest.

daspecster · 2017-03-20T17:45:43Z

@lukesneeringer did you have anything else for this?

…oogleCloudPlatform/python-docs-samples#3160)

Raise exception if bucket name is invalid.

4e73251

daspecster added the api: storage Issues related to the Cloud Storage API. label Mar 16, 2017

daspecster assigned lukesneeringer, tseaver and dhermes Mar 16, 2017

googlebot added the cla: yes This human has signed the Contributor License Agreement. label Mar 16, 2017

daspecster commented Mar 16, 2017

View reviewed changes

storage/google/cloud/storage/_helpers.py Outdated

def __init__(self, name=None):

self.name = name

if name is None or (re.match(r'\w', name[0]) and

This comment was marked as spam.

Sign in to view

This comment was marked as spam.

Sign in to view

Update Bucket name docstring to define valid values.

8c47c93

lukesneeringer approved these changes Mar 17, 2017

View reviewed changes

lukesneeringer suggested changes Mar 17, 2017

View reviewed changes

Add _validate_name helper method.

79f107d

daspecster force-pushed the add-bucket-name-exception branch from 99ce2e0 to 79f107d Compare March 17, 2017 19:56

lukesneeringer approved these changes Mar 20, 2017

View reviewed changes

lukesneeringer merged commit 1b5e35c into googleapis:master Mar 20, 2017

daspecster deleted the add-bucket-name-exception branch March 20, 2017 18:05

richkadel pushed a commit to richkadel/google-cloud-python that referenced this pull request May 6, 2017

Raise ValueError exception if bucket name is invalid. (googleapis#3160)

d0c9174

theacodes unassigned dhermes Sep 28, 2018

parthea pushed a commit that referenced this pull request Oct 21, 2023

chore(deps): update dependency google-cloud-kms to v1.3.0 [(#3160)](G…

50ae2ed

…oogleCloudPlatform/python-docs-samples#3160)

Raise ValueError exception if bucket name is invalid. #3160

Raise ValueError exception if bucket name is invalid. #3160

Uh oh!

Conversation

daspecster commented Mar 16, 2017

Uh oh!

dhermes commented Mar 16, 2017

Uh oh!

This comment was marked as spam.

Uh oh!

This comment was marked as spam.

Uh oh!

daspecster commented Mar 16, 2017

Uh oh!

dhermes commented Mar 16, 2017

Uh oh!

tseaver commented Mar 16, 2017

Uh oh!

daspecster commented Mar 16, 2017

Uh oh!

dhermes commented Mar 16, 2017

Uh oh!

tseaver commented Mar 16, 2017

Uh oh!

daspecster commented Mar 16, 2017

Uh oh!

lukesneeringer commented Mar 16, 2017

Uh oh!

tseaver commented Mar 16, 2017

Uh oh!

lukesneeringer commented Mar 16, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lukesneeringer commented Mar 16, 2017

Uh oh!

lukesneeringer left a comment

Choose a reason for hiding this comment

Uh oh!

dhermes commented Mar 17, 2017

Uh oh!

lukesneeringer commented Mar 17, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lukesneeringer commented Mar 17, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lukesneeringer left a comment

Choose a reason for hiding this comment

Uh oh!

daspecster commented Mar 17, 2017

Uh oh!

lukesneeringer commented Mar 17, 2017

Uh oh!

daspecster commented Mar 17, 2017

Uh oh!

dhermes commented Mar 17, 2017

Uh oh!

daspecster commented Mar 17, 2017

Uh oh!

lukesneeringer commented Mar 17, 2017

Uh oh!

daspecster commented Mar 20, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

lukesneeringer commented Mar 16, 2017 •

edited

Loading

lukesneeringer commented Mar 17, 2017 •

edited

Loading

lukesneeringer commented Mar 17, 2017 •

edited

Loading