On backpressure block #1902

akarnokd · 2014-11-26T13:04:17Z

After watching Ben's great JavaOne 2014 talk, I was wondering why there isn't a onBackpressureBlock, i.e., a strategy which blocks the producer thread.

On one hand, I can accept we don't want to encourage blocking operations, on the other hand, this operator may allow casual Observable.create() implementations to not worry about backpressure at the expense of blocking one Schedulers.io() thread.

jbripley · 2014-11-26T13:45:02Z

I definitely think this is useful.

What I'd also like is some helpers to help you implement your own back pressure aware Observable. We've made an internal wrapper class for doing blocking back pressure Observables, similar to this, but we're not sure if that's the best approach.

backpressure.

akarnokd · 2014-11-26T14:02:15Z

If you can express your computation in Iterable form, then you get backpressure for free. In one of the bug reports earlier, the person used Guava's AbstractIterable where you only need to implement a single method calculating the next value or signal completion.

davidmoten · 2014-11-26T21:53:49Z

Great point David, would be good to have that suggestion in the wiki.

On 27 November 2014 at 01:02, David Karnok [email protected] wrote:

If you can express your computation in Iterable form, then you get
backpressure for free. In one of the bug reports earlier, the person used
Guava's AbstractIterable where you only need to implement a single method
calculating the next value or signal completion.

—
Reply to this email directly or view it on GitHub
#1902 (comment).

benjchristensen · 2014-11-29T05:08:27Z

Would it be a good idea to fail (emit onError) if someone tries this when operating on the computation schedulers?

akarnokd · 2014-11-29T11:13:33Z

If we could reliably detect the computation scheduler, but even if, what If the source emission jumps between the IO scheduler and the Computation scheduler due to serialization, and the scheduler is detected only after some time?

benjchristensen · 2014-11-29T16:42:44Z

we could reliably detect the computation scheduler

That's easy enough to do.

what If the source emission jumps between the IO scheduler and the Computation

I don't understand what role this plays. If the blocking was attempted in the IO Scheduler that's fine. If attempts on the Computation Scheduler, that's not okay. Blocking on the IO Scheduler would not cause the Computation thread to block since observeOn has a queue between the threads.

akarnokd · 2014-11-29T18:34:19Z

I guess you mean, for example, Thread.currentThread().getName().startsWith("RxComputation").
This has to be done on every onNext call which certainly adds overhead to all events, regardless if they would block the producer. I'd instead simply document the danger of using the operator on a stream observed on or subscribed on the computation thread.

benjchristensen · 2014-11-29T18:40:17Z

It can be done easier and cheaper with a simple ThreadLocal boolean and a static check. We want this anyways so we stop randomly scheduling and bouncing between threads. That's an optimization I'll be adding soon. Netty does similar and it's a big improvement.

akarnokd · 2014-11-29T18:58:55Z

When I was experimenting with a balancing computation scheduler, I tried this ThreadLocal approach, but its internal HashMap lookup takes some time. However, since we alread provide custom threads to the executor pool, we can identify the threads by their class after Thread.currentThread().

benjchristensen · 2014-11-29T19:24:08Z

What time did you measure as an impact per operation?

benjchristensen · 2014-11-29T20:07:08Z

Here is what Netty does that may be worth us looking at if we want to pursue this route: https://github.com/netty/netty/blob/b72a05edb41a97017e9bd7877edcd2014a8d99d9/common/src/main/java/io/netty/util/concurrent/FastThreadLocal.java

Sounds like they found the same thing as you did regarding its use of HashMap.

akarnokd · 2014-11-29T20:09:17Z

It cost about 10-15 cycles (about 5ns per item on i7 920, 2ns on i7 4770) which seems small, but if the actual operation is fast, this may seem to slow down things. This is why I abandoned it back then because it lowered the perf numbers.

I've tried benchmarking the difference between a threadlocal and a cast to the custom thread, but I don't fully trust the results: ThreadLocal: 3.3ns, Thread+cast: 0.8ns.

public class ThreadTypeCost {
    static final class MyThread extends Thread {
        public int threadLocal = 1;
        public MyThread(Runnable r) {
            super(r);
        }
    }
    static int f;
    public static void main(String[] args) throws Exception {
        ExecutorService exec = Executors.newFixedThreadPool(1, f -> {
            MyThread t = new MyThread(f);
            return t;
        });

        ThreadLocal<Boolean> tl = new ThreadLocal<Boolean>() {
            @Override
            public Boolean initialValue() {
                return true;
            }
        };

        int k = 10_000_000;
        long n = System.nanoTime();
        exec.submit(() -> {
            for (int i = 0; i < k; i++) {
//              Thread t = Thread.currentThread();
//              if (t instanceof MyThread) {
//                  MyThread mt = (MyThread) t;
//                  f += mt.threadLocal;
//              }
                if (tl.get()) {
                    f++;
                }
            }
        });
        exec.shutdown();
        exec.awaitTermination(1, TimeUnit.HOURS);
        System.out.println(1d * (System.nanoTime() - n) / k);
    }
}

benjchristensen · 2014-11-29T20:34:39Z

I think we should use JMH for this. I've read enough about why JMH was created and the problems with using nanoTime directly to mistrust basically all benchmarks now :-)

benjchristensen · 2014-11-29T20:35:10Z

Once we agree upon #1824 let's mark this as @Beta and merge it so we can start using it and then improve as needed.

akarnokd · 2014-11-29T21:23:49Z

Okay.

benjchristensen · 2014-11-29T21:52:30Z

I added @Experimental to your commits in https://github.com/ReactiveX/RxJava/pull/1907/files

DavidMGross · 2014-12-02T19:42:21Z

Does this diagram (
https://raw.githubusercontent.com/wiki/ReactiveX/RxJava/images/rx-operators/bp.obp.block.png)
correctly illustrate the behavior of this operator?

On Sat, Nov 29, 2014 at 1:52 PM, Ben Christensen [email protected]
wrote:

Closed #1902 #1902.

—
Reply to this email directly or view it on GitHub
#1902 (comment).

David M. Gross
PLP Consulting

akarnokd · 2014-12-02T20:08:32Z

I'm not sure.

These patterns are non-blocking:
src:1 | request(1) | out: 1
src:1 | src:2 | request(1) | out: 1 | request(1) | out: 2
src:1 | src: 2 | request(1) | src: 3 | out: 1 | request(1) | out: 2 | request(1) | out: 3
request(128) | src: 1 | out: 1 | src: 2 | out: 2 | src: 3 | out: 3 |

These patterns are blocking
src:1 | src: 2 | src 3: blocked | request(1) | src 3: unblocked & out: 1 | request(2) | out: 2 | out: 3
request(128) | src: 1 | src: 2 | src: 3: blocked | src 3: unblocked & out: 1 | out: 2 | out: 3

this latter may happen if the consumer runs on a different thread and is slowly draining.

So if the consumer is slow or doesn't give enough permits, the queue will fill up and will block on the next over-capacity enqueue attempt. At this point, only the consumer running on another thread may unblock the producer by requesting more and then draining the queue until the queue becomes empty or the permits run out.

akarnokd added 2 commits November 26, 2014 13:54

Operator OnBackpressureBlock

e670021

Fixed accidental import * expansion.

fc6ea3a

Do not leave drain if queue has data and downstream doesn't do

beb5156

backpressure.

Fixed typo.

1bf3faf

benjchristensen added the Enhancement label Nov 29, 2014

benjchristensen added this to the 1.1 milestone Nov 29, 2014

benjchristensen closed this Nov 29, 2014

akarnokd deleted the OnBackpressureBlock branch May 6, 2015 06:49

Uh oh!

On backpressure block #1902

On backpressure block #1902

Uh oh!

Conversation

akarnokd commented Nov 26, 2014

Uh oh!

jbripley commented Nov 26, 2014

Uh oh!

akarnokd commented Nov 26, 2014

Uh oh!

davidmoten commented Nov 26, 2014

Uh oh!

benjchristensen commented Nov 29, 2014

Uh oh!

akarnokd commented Nov 29, 2014

Uh oh!

benjchristensen commented Nov 29, 2014

Uh oh!

akarnokd commented Nov 29, 2014

Uh oh!

benjchristensen commented Nov 29, 2014

Uh oh!

akarnokd commented Nov 29, 2014

Uh oh!

benjchristensen commented Nov 29, 2014

Uh oh!

benjchristensen commented Nov 29, 2014

Uh oh!

akarnokd commented Nov 29, 2014

Uh oh!

benjchristensen commented Nov 29, 2014

Uh oh!

benjchristensen commented Nov 29, 2014

Uh oh!

akarnokd commented Nov 29, 2014

Uh oh!

benjchristensen commented Nov 29, 2014

Uh oh!

DavidMGross commented Dec 2, 2014

Uh oh!

akarnokd commented Dec 2, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants