Adrià
Adrià
Oops, I completely missed that. Sorry for the noise, then
I was thinking more about a version that parses the network directly with a visitor that, for each layer, it maps it to the corresponding ONNX operation, specifies the inputs/outputs...
But visitors are fun, all that SFINAE feels almost like pattern matching in functional languages, haha!
In my humble opinion, there are two key aspects unique to dlib in that network: - the input layer: [input_rgb_image_pyramid](http://dlib.net/ml.html#input_rgb_image_pyramid) - this layer creates a tiled pyramid of the input...
It seems like it could be achieved with something like this: http://dlib.net/imaging.html#object_detector Example programs: - [C++](http://dlib.net/train_object_detector.cpp.html) - [Python](http://dlib.net/train_object_detector.py.html)
Nice! Have you checked if you can open GIF animated files? With `cv::VideoCapture` you can read them as if they were a video file. If that works, we will also...
I'm genuinely interested in this feature, but I don't think I'll have time soon to work on it, unfortunately :(
I want this so badly, that I will try to make it work with CMake (never done that before, though) before it gets closed and falls into oblivion :P
I've added support for finding FFMPEG using PkgConfig in CMake. @pfeatherstone would you mind giving me write access to this PR?
Yes, I already pushed a PR: https://github.com/pfeatherstone/dlib/pull/1