Add std::io::util #10895

sfackler · 2013-12-10T06:54:10Z

This adds a bunch of useful Reader and Writer implementations. I'm not a
huge fan of the name util but I can't think of a better name and I
don't want to make std::io any longer than it already is.

huonw · 2013-12-10T06:57:19Z

src/libstd/io/util.rs

+    }
+}
+
+impl Reader for ChainedReader {


Is there a particular reason this is not next to the ChainedReader declaration?

A typo :). Updated.

alexcrichton · 2013-12-10T07:05:17Z

This looks really good to me, nice work!

I'd like to get another pair or two of eyes on this though. I'm always a little wary of adding new util modules, but I agree that all of these are very useful utilities which we should have. The only other useful utility-style thing which I would want is a function to copy a reader into a writer. Other than that, this seems pretty complete to me (but others should weigh in).

One possible way to get rid of util would be to just reexport these under std::io directly.

sfackler · 2013-12-10T07:23:52Z

@alexcrichton I've added a copy function in. It's currently using the same uninitialized buffer trick that BufferedReader and BufferedWriter do. It does seem a bit dangerous to me since a Reader can lie about the amount of data it's given you and you'll end up looking at uninitialized memory by accident, but I don't know if that's something worth worrying about. It may be worth making an unsafe vec::bytes::with_size_uninitialized function since we're doing this in two places.

I'm not a huge fan of reexporting since it just seems to confuse the documentation.

sfackler · 2013-12-10T07:33:18Z

@alexcrichton updated.

alexcrichton · 2013-12-10T16:56:05Z

Another option I just thought of is a couple of newline-conversion utilities. Something like a reader -> reader converting newlines or a writer -> writer converting newlines (to \r\n or the other way).

bill-myers · 2013-12-10T18:01:41Z

ChainedReader should take an iterator instead of a vector.

Those who want to use a vector can then pass its move_iter(), which will result in something that behaves exactly like the current version, and it's also possible to pass a custom iterator that lazily creates the readers.

Plus, this way readers that are done with are destroyed immediately with no extra code rather than kept around unused.

For instance, this would allow to concatenate files named "file.000", "file.001", "file.002", etc. and opening and closing each file at a time, while the current code requires to have all of them open at once, which is worse and might even fail due to OS open file limits.

Also, there should probably be another version that takes only two readers, but with a concrete type put in a type parameter (like TeeWriter, except it takes two readers instead of a writer).

A NullReader/EmptyReader would complete the set.

LimitReader should probably take the reader by value instead of &mut just like TeeWriter does (or, if taking by &mut were deemed best, which I think would be a mistake, then TeeWriter should take by &mut).

sfackler · 2013-12-10T18:13:52Z

I'll look into switching ChainedReader over to taking an iterator.

LimitReader takes the reader by reference because (I think) the common use case will be something like

let mut r = some_reader;
let len = r.read_be_i32() as uint;
let some_struct = read_some_struct(LimitedReader::new(len, &mut r));
...

Where the reader is just loaned out temporarily. If LimitedReader took the reader by value, you'd need to add a couple of extra lines of wrapping and unwrapping.

bill-myers · 2013-12-10T18:18:12Z

&mut Reader implements Reader, so that code should continue working even if LimitedReader takes an <R: Reader> by value (in that code, we'd have R = &mut SomeReader, which implements Reader).

That's why I think passing by value (where the type is a generic argument) is the correct choice, since it is strictly more general.

In fact, I believe that ChainedReader should even be ChainedReader<R: Reader, I: Iterator> so that it can be passed any of Iterator, Iterator<&mut Reader> or Iterator<~Reader>.

sfackler · 2013-12-11T03:27:10Z

@bill-myers &mut Reader implements Reader but &mut R where R: Reader does not.

huonw · 2013-12-11T04:35:24Z

src/libstd/io/util.rs

+            match self.cur_reader {
+                Some(ref mut r) => {
+                    match r.read(buf) {
+                        Some(len) => return Some(len),


I feel like the behaviour of this is slightly peculiar, e.g., say you had two 100 byte files, it would take two calls to this .read to fill a [u8, .. 200], right?

Shouldn't this function be attempting to fill the buffer?

That doesn't seem safe. What happens if the first read returns valid data and the second one raises an error? I don't think it'd be good to throw away the data that it was able to read. read_to_end exists if you explicitly want to do that, and another method on Reader that repeatedly read to fill the buffer could be added as well.

Ah, true enough.

This adds a bunch of useful Reader and Writer implementations. I'm not a huge fan of the name `util` but I can't think of a better name and I don't want to make `std::io` any longer than it already is.

huonw reviewed Dec 10, 2013
View reviewed changes

huonw reviewed Dec 11, 2013
View reviewed changes

Add std::io::util

7fe5e30

This adds a bunch of useful Reader and Writer implementations. I'm not a huge fan of the name `util` but I can't think of a better name and I don't want to make `std::io` any longer than it already is.

bors closed this Dec 13, 2013

bors merged commit 7fe5e30 into rust-lang:master Dec 13, 2013

sfackler deleted the io-util branch December 23, 2013 03:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add std::io::util #10895

Add std::io::util #10895

Uh oh!

sfackler commented Dec 10, 2013

Uh oh!

huonw Dec 10, 2013

Uh oh!

sfackler Dec 10, 2013

Uh oh!

alexcrichton commented Dec 10, 2013

Uh oh!

sfackler commented Dec 10, 2013

Uh oh!

sfackler commented Dec 10, 2013

Uh oh!

alexcrichton commented Dec 10, 2013

Uh oh!

bill-myers commented Dec 10, 2013

Uh oh!

sfackler commented Dec 10, 2013

Uh oh!

bill-myers commented Dec 10, 2013

Uh oh!

sfackler commented Dec 11, 2013

Uh oh!

huonw Dec 11, 2013

Uh oh!

sfackler Dec 11, 2013

Uh oh!

huonw Dec 11, 2013

Uh oh!

Uh oh!

Add std::io::util #10895

Add std::io::util #10895

Uh oh!

Conversation

sfackler commented Dec 10, 2013

Uh oh!

huonw Dec 10, 2013

Choose a reason for hiding this comment

Uh oh!

sfackler Dec 10, 2013

Choose a reason for hiding this comment

Uh oh!

alexcrichton commented Dec 10, 2013

Uh oh!

sfackler commented Dec 10, 2013

Uh oh!

sfackler commented Dec 10, 2013

Uh oh!

alexcrichton commented Dec 10, 2013

Uh oh!

bill-myers commented Dec 10, 2013

Uh oh!

sfackler commented Dec 10, 2013

Uh oh!

bill-myers commented Dec 10, 2013

Uh oh!

sfackler commented Dec 11, 2013

Uh oh!

huonw Dec 11, 2013

Choose a reason for hiding this comment

Uh oh!

sfackler Dec 11, 2013

Choose a reason for hiding this comment

Uh oh!

huonw Dec 11, 2013

Choose a reason for hiding this comment

Uh oh!

Uh oh!