Currently, `ggml` forces the user to allocate the compute graphs on the stack. The `ggml` API should be extended to support using heap allocated graphs.