IBM compiles dataset to teach software how software is made: 14m code samples, half of which actually work

Pete 2 Silver badge

correct, secure, fast - choose one.

> About half of the samples work as expected (hopefully the authors did not expect it to fail?)

Functionality is nice, but to do it securely is better. If this IBM data can be used to re-write code so that it is hardened against hacks, then it might have some use.

And best of all, is if the code can be made to work efficiently and without bloat.

