deep convolution image software does resizing

piggsypiggsy Registered Users Posts: 88 Big grins
edited August 2, 2016 in Digital Darkroom
Just wanted to share this neat toy -

http://waifu2x.udp.jp/

it's kind of along the same lines as neural styles, deep dream, prisma, etc, which you may have heard of before. Only rather than repeatedly trying to match images to an artistic style over many iterations, this one tries to draw clean lines and (optionally) de-noise an image.

Probably best to use their own example image to show what it does -

ed6x8lr.jpg

as in its suggested use case it does a fantastic job of resizing compressed images of drawn art with lots of bold colours and strong lines. And it's really designed to hold up just for a goof image where you you have a bit of dvd or blu ray captured and want to make it visible at non-ant sizes. But if you start feeding it lossless images (or better, lossless unsharpened images) and keep on scaling, it can do some remarkable things.

Some examples -

100% - original image (1920x1200)
dejesus100-1xfusl2.png

100% - 8x scaled image (15360x9600)
http://abload.de/img/dejesus100-8xlysgc.png

100% - original image (1151x1600)
gorobo-100-1xw0uu1.png

100% - 8x scaled image (9208x12800)
http://abload.de/img/gorobo-100-8xvfurs.png

If you manage to find uncompressed sources of an image made of clearly defined lines you're pretty much only limited by memory requirements (much past ~10k x 15k pics do my 4GB 970GTX/16GB RAM box in). Although if you manage to de-jpg well enough to begin with you can push a jpg very far also, at least, probably far enough for most non-billboard purposes.

Here's how it handles a regular image of mine -
p4100124thpfsm7.jpg

feed it the TIFF source (3411x2625)

100% - source

clipboard02sbz46.jpg

100% - 4x scaled output (13644x10500)

https://abload.de/img/clipboard01yvzgz.jpg

That's the simplest possible scaling too, without going in and doing anything differently noise or sharpening wise, just, asking it to blow it up. That's pretty neat.

Anyway, for most people who aren't prepared to install a linux, there's a web portal version of it at the link with a small-ish file output limit, and if you hit their github, you'll find a windows executable version of it with an installable GUI and right click "send to" menu support (look in waifu2x-converter_x64_1130\ExtendedSendTo) here:

https://github.com/tanakamura/waifu2x-converter-cpp

ed- just to add, this is one of the more lightweight neural network photography things going - even if you don't have a CUDA or OpenCL/GCN supported GPU, you can still make very big images in reasonable timeframes on the CPU only. About 15-40 minutes depending on the source and output on my Haswell/DC I5 at 4.3ghz, vs about 0.5/1 minute on the 970.

Comments

Sign In or Register to comment.