error-chain Memory bloat

trafficstars

The code (basing on the example from the readme)

#![recursion_limit = "1024"]
#[macro_use]
extern crate error_chain;

mod errors {
    error_chain! { }
}

fn main () {
    use errors::Result as ChainResult;
    use std::mem::size_of;
    println!("size of vanilla result {}", size_of::<Result<(), ()>>());
    println!("size of error_chain result {}", size_of::<ChainResult<()>>());
}

outputs:

size of vanilla result 1
size of error_chain result 56

I might be wrong, but isn't ChainResult just "yet another enum", which means those 55 extra bytes have to be copied over when returning, even in the success event? That would mean quite a performance overhead over normal Result in hot code, especially where only the Ok case which may be minimal is encountered, no?

Maybe the situation can be improved by making the backtrace functionality optional, as in possible to opt out of at compile time?

Sep 16 '16 01:09 est31

@kali has made a very interesting overview over the origin of the 56 bytes: https://github.com/brson/error-chain/pull/45#issuecomment-249410945

Oct 01 '16 20:10 est31

This doesn't seem like a great comparison - Result<(), ()> carries no data aside from the discriminant. No one is using Result<(), ()> (if they are, they should probably be using bool or a new enum). How does an error_chain generated result compare to say io::Result, carrying the same information?

Oct 01 '16 23:10 withoutboats

if they are, they should probably be using bool or a new enum

Result<(), ()> is very useful as it can be used together with the try macro, and it carries the information for the reader with it that the operation is failable. Bool can mean anything, you have to read the docs to find out what it means. Documenting your API over the type system is one of the things Rust is about. Also, it gives the user a warning that it needs to be used, unlike bool.

Regardless of whether Result<(),()> is considered something to avoid or not, which can be surely discussed about, I only wanted to measure what the additional overhead was of the error-chain error.

And about io::Result: io::Result<()> uses 24 bytes, which is still less than half of what the error-chain Result is carrying.

Oct 02 '16 10:10 est31

Regardless of whether Result<(),()> is considered something to avoid or not, which can be surely discussed about, I only wanted to measure what the additional overhead was of the error-chain error.

OK but you've used a type which has 0 bytes of data for the non-discriminant portion. All you've documented is that the largest variant of the null error chain is 55 bytes. That doesn't mean that the equivalent of io::Result is 79 bytes, it might still be 55 bytes large.

Oct 02 '16 10:10 withoutboats

Not really an issue, closing.

Nov 17 '16 03:11 Yamakaky

This is a pretty major issue of this crate, as its against the "performance speed ease of use pick three" motto of Rust, by improving ease of use by heavily impairing performance.

Nov 17 '16 09:11 est31

The solution would be to make the Msg variant optional. Any idea about the macro syntax?

Nov 19 '16 07:11 Yamakaky

https://github.com/brson/error-chain/commit/2d34f22fb90f2d3159c0bd2eaa890ac6130a4096

Nov 19 '16 08:11 Yamakaky

If we assume that errors are less likely than the happy path, we can optimize the memory footprint for the happy path if we heap-allocate most of the error payload.

If we look at Result<T, E>, then T might be as small as (). So it might be desirable to get E as small as a pointer, i.e. boxing everything E = Box<SomeError> without erasing the type information (i.e. no trait object). In situations where errors are unlikely, the error-path may be expensive as long as the happy-path is optimized.

But on the other hand there may be situations where such heap allocations are not desirable, e.g. (i) when errors happen frequently or (ii) when compiling with #[no_std] and heap allocations are not possible or (iii) error handling in oom situations.

So maybe error_chain could offer different memory layouts which implement the same interfaces and are therefore completely exchangeable (when looked at from the outside).

Nov 22 '16 22:11 colin-kiegel

Hum, interesting. Any idea for a the syntax?

Nov 22 '16 22:11 Yamakaky

I think the first question is: do you want this behavior to be controlled by syntax in the macro invocation, or by a crate feature? A crate feature would be easier to implement & IMO probably easier to use (less new syntax to learn), but wouldn't allow you to scope the behavior to an individual invocation. I'm not sure how often people are defining multiple error_chains in their crate, much less they have a need to different performance properties between them.

Nov 22 '16 23:11 withoutboats

A crate feature would be enough for me, too. Anything that makes the memory footprint smaller :)

Nov 22 '16 23:11 est31

I think you could start with a crate feature. That should be sufficient in most cases. A syntax could still be introduced later if there is demand and the crate feature would then only set the default behaviour.

Nov 22 '16 23:11 colin-kiegel

Someone correct me if we don't guarantee this yet, but I think if you box the error, Result<()>s will be subject to the NPO, and be 1 pointer in size. The Ok case will be represented by a null pointer. This seems pretty :+1:.

Nov 22 '16 23:11 withoutboats

I started the implementation with the feature, but I get errors like Box<Error>: std::convert::From<std::io::Error> not satisfied. I guess I have to use a tuple struct?

Nov 22 '16 23:11 Yamakaky

@Yamakaky Have you tried implementing From<E: std::error::Error> for Box<Error>? Box is exempted from some of the normal orphan rules and so it might work. (Seems to work in playpen: https://is.gd/fjbjZS)

Edit: Looking at all the impls for the error type, you probably will need a tuple struct, because some of them like Deref for example will conflict with std impls.

Nov 22 '16 23:11 withoutboats

Oh, cool!

Nov 22 '16 23:11 Yamakaky

Please review https://github.com/brson/error-chain/pull/69

Nov 22 '16 23:11 Yamakaky

Result<()>: 8
  (): 0
  Error: 8

Nov 23 '16 20:11 Yamakaky

Awesome!

Nov 23 '16 20:11 est31

Is there still work to be done on this issue?

Nov 04 '17 03:11 cgm616

Was this feature implemented? If not, why it was abandoned?

Jan 25 '18 07:01 willir

Honestly, I don't really work on a Rust project anymore, so I haven't been working on error-chain for some time...

Jan 25 '18 19:01 Yamakaky

error-chain error-chain copied to clipboard

Memory bloat

error-chain
error-chain copied to clipboard