>$1M for the AI that can close 90% of new GitHub issues If your AI can do this, ...

andyk · 2024-12-16T21:07:30 1734383250

(reposting from locallama and lower down here) yep that's true.

one of my goals is to inspire and honor those that work on open source AI. Those people tend to be motivated by things like impact and the excitement of being part of something big. i know that's how i always feel when i'm around Berkeley and get to meet or work with OG BSD hackers or the people who helped invent core internet protocols.

those people are doing this kind of OSS work and sharing it with the world anyway, without any cash prize. i think of this as a sort of thank you gift for them. and also a way to maybe convince a few people to explore that path who might not have otherwise.

senko · 2024-12-16T17:33:09 1734370389

And Linux kernel, curl, SQLite and many other open source software are worth infinitely more than the purchase price.

Also, you cut off the "from the benchmark" part; this doesn't expect it to solve any random Github issue, just the ones from the (presumably manually vetted and cleaned up) bench dataset.

minimaxir · 2024-12-16T17:40:27 1734370827

Linux kernel, curl, and SQLite don't require significant compute cost to develop that put it out of reach of hobbyists, and only in the reach of organizations expecting a positive ROI.

senko · 2024-12-16T18:13:03 1734372783

The cost of Linux kernel development alone has been estimated at a few $B (https://dwheeler.com/essays/linux-kernel-cost.html), current figure is probably over tens of billions.

Also, the prize doesn't require you to train a new foundational model, just that whatever you use is open weights or open source.

Theoretically, might be get away with a Llama3.3 (or any other model which you think makes sense) with a cleverly designed agentic system and a fresh codebase-understanding approach, with minimal compute cost.

(ok, probably not that easy, but just saying there's much more to AI coding that the underlying model)

bruce511 · 2024-12-16T19:12:44 1734376364

>> The cost of Linux kernel development alone has been estimated at a few $B (https://dwheeler.com/essays/linux-kernel-cost.html), current figure is probably over tens of billions.

I followed your link, but it doesn't seem to bear out upur assertion. The two numbers mentioned in the article are 176 mil and 612 mil. Mind you those weren't an estimate of cost, but rather an estimate to replace. Article is dated 2004, with an update in 2011.

Using the lines-of-code estimation it crossed a billion in 2010 - again to replace. That has no relation to what it did actually cost.

Getting from there to "tens of billions" seems a stretch. Assuming a bottom value in your estimate of 20 billion, and assuming a developer costs a million a year, that's 20 000 man-years of effort. Which implies something like 2000 people (very well paid people) working continuously for the last decade.

Which seems, well, unlikely.

senko · 2024-12-16T20:09:01 1734379741

> The two numbers mentioned in the article are 176 mil and 612 mil.

Those two numbers are from the intro. The postscript and the updates at the end mention $1.4b and $3b respectively.

The real cost is probably impossible to calculate, but that order of magnitude is a reasonable estimate IMHO, and absolutely comparable, or even larger, than compute costs for SOTA LLMs

olddustytrail · 2024-12-16T19:28:44 1734377324

There are around 5000 active kernel devs, they are generally highly skilled and therefore highly paid, and they've been working for a lot longer than 10 years.

So doesn't seem that unlikely based on your estimates.

stevage · 2024-12-16T20:10:25 1734379825

Highly paid like a million a year? Is that a thing?

senko · 2024-12-16T20:43:24 1734381804

Linux kernel has been in development since the nineties, not just for the last ten years. Also 5000 contributors is a lot more than 2000 from gp's comment.

Let's ignore the years before dotcom boom since the dev community was probably much smaller, and assume an average of 3500 contributors since.

That's 25 years * 3500 contributors on average * 200k salary (total employee cost, not take home) = $17.5b

Napkin math, but order of magnitude checks out.

frgtpsswrdlame · 2024-12-16T22:26:52 1734388012

If you're the only one that can come close. Kaggle competition prizes are about focusing smart people on the same problem. But it's very rare for one team to blow all the others out of the water. So if you wanted to make a business out of the problem kaggle will (probably) show the best you could do and still have no moat.

toast0 · 2024-12-17T15:39:18 1734449958

Can't the stale bot already do this?

segmondy · 2024-12-16T17:00:00 1734368400

Exactly, I'll personally buy it for $2Million for anyone that can get it and assign me the full code/weight and rights.

andyk · 2024-12-16T21:27:28 1734384448

I hope the competition will inspire people to make breakthroughs in the open, so I won't take any rights to the IP, instead the winning solutions must use open source code and open weight models.

noch · 2024-12-16T17:27:55 1734370075

> I'll personally buy it for $2Million for anyone that can get it and assign me the full code/weight and rights.

If you are serious, you should put the funds in an escrow contract and announce a bounty.

There are many brilliant people who would work on this for you.

thrance · 2024-12-17T01:51:07 1734400267

It's 90% of a selection of new GitHub issues, we don't know about the complexity of these. I don't think they'd ask the AI for a giant refactoring of the codebase, for example.