Best Comments | Hacker News

gdb 3 days ago | parent | context | on: GPT-4o

(I work at OpenAI.)

It's really how it works.

anon373839 3 days ago | parent | context | on: GPT-4o

> We’re moving toward a world where every job will be modeled

After an OpenAI launch, I think it's important to take one's feelings about the future impact of the technology with a HUGE grain of salt. OpenAI are masters of hype. They have been generating hype for years now, yet the real-world impacts remain modest so far.

Do you remember when they teased GPT-2 as "too dangerous" for public access? I do. Yet we now have Llama 3 in the wild, which even at the smaller 8B size is about as powerful as the [edit: 6/13/23] GPT-4 release.

As someone pointed out elsewhere in the comments, a logistic curve looks exponential in the beginning, before it approaches saturation. Yet, logistic curves are more common, especially in ML. I think it's interesting that GPT-4o doesn't show much of an improvement in "reasoning" strength.

typpo 2 days ago | parent | context | on: Veo

The amount of negativity in these comments is astounding. Congrats to the teams at Google on what they have built, and hoping for more competition and progress in this space.

cal85 3 days ago | parent | context | on: GPT-4o

We've had voice input and voice output with computers for a long time, but it's never felt like spoken conversation. At best it's a series of separate voice notes. It feels more like texting than talking.

These demos show people talking to artificial intelligence. This is new. Humans are more partial to talking than writing. When people talk to each other (in person or over low-latency audio) there's a rich metadata channel of tone and timing, subtext, inexplicit knowledge. These videos seem to show the AI using this kind of metadata, in both input and output, and the conversation even flows reasonably well at times. I think this changes things a lot.

ertgbnm 3 days ago | parent | context | on: It’s an age of marvels

More than anything else, I think the modern American super market would blow the minds of anyone born before 1900 more than any other marvel that exists.

You have blueberries for sale in January??? A variety box of tea from 7 different countries? A wall of spices? Pineapples? Packaging made from aluminum that is just thrown away? The bread isn't full of sand and grit? And it's sliced!!!

All relatively affordable and accessible to the average person.

Jensson 3 days ago | parent | context | on: GPT-4o

The most impressive part is that the voice uses the right feelings and tonal language during the presentation. I'm not sure how much of that was that they had tested this over and over, but it is really hard to get that right so if they didn't fake it in some way I'd say that is revolutionary.

cs702 3 days ago | parent | context | on: GPT-4o

The usual critics will quickly point out that LLMs like GPT-4o still have a lot of failure modes and suffer from issues that remain unresolved. They will point out that we're reaping diminishing returns from Transformers. They will question the absence of a "GPT-5" model. And so on -- blah, blah, blah, stochastic parrots, blah, blah, blah.

Ignore the critics. Watch the demos. Play with it.

This stuff feels magical. Magical. It makes the movie "Her" look like it's no longer in the realm of science fiction but in the realm of incremental product development. HAL's unemotional monotone in Kubrick's movie, "Space Odyssey," feels... oddly primitive by comparison. I'm impressed at how well this works.

Well-deserved congratulations to everyone at OpenAI!

baq 3 days ago | parent | context | on: GPT-4o

> (I work at OpenAI.)

Winner of the 'understatement of the week' award (and it's only Monday).

Also top contender in the 'technically correct' category.

dragonwriter 3 days ago | parent | context | on: GPT-4o

> This stuff feels magical. Magical.

Because its capacities are focused on exactly the right place to feel magical. Which isn’t to say that there isn’t real utility, but language (written, and even moreso spoken) has an enormous emotional resonance for humans, so this is laser-targeted in an area where every advance is going to “feel magical” whether or not it moves the needle much on practical utility; it’s not unlike the effect of TV news making you feel informed, even though time spent watching it negatively correlates with understanding of current events.

zoogeny 2 days ago | parent | context | on: Ilya Sutskever to leave OpenAI

Interesting, both Karpathy and Sutskever are gone from OpenAI now. Looks like it is now the Sam Altman and Greg Brockman show.

I have to admit, of the four, Karpathy and Sutskever were the two I was most impressed with. I hope he goes on to do something great.

simonw 3 days ago | parent | context | on: Falcon 2

The license is not good: https://falconllm-staging.tii.ae/falcon-2-terms-and-conditio...

It's a modified Apache 2 license with extra clauses that include a requirement to abide by their acceptable use policy, hosted here: https://falconllm-staging.tii.ae/falcon-2-acceptable-use-pol...

But... that modified Apache 2 license says the following:

"The Acceptable Use Policy may be updated from time to time. You should monitor the web address at which the Acceptable Use Policy is hosted to ensure that your use of the Work or any Derivative Work complies with the updated Acceptable Use Policy."

So no matter what you think of their current AUP they reserve the right to update it to anything they like in the future, and you'll have to abide by the new one!

Great example of why I don't like the trend of calling licenses like this "open source" when they aren't compatible with the OSI definition.

plaidfuji 3 days ago | parent | context | on: GPT-4o

This is a very cool demo - if you dig deeper there’s a clip of them having a “blind” AI talk to another AI with live camera input to ask it to explain what it’s seeing. Then they, together, sing a song about what they’re looking at, alternating each line, and rhyming with one another. Given all of the isolated capabilities of AI, this isn’t particularly surprising, but seeing it all work together in real time is pretty incredible.

But it’s not scary. It’s… marvelous, cringey, uncomfortable, awe-inspiring. What’s scary is not what AI can currently do, but what we expect from it. Can it do math yet? Can it play chess? Can it write entire apps from scratch? Can it just do my entire job for me?

We’re moving toward a world where every job will be modeled, and you’ll either be an AI owner, a model architect, an agent/hardware engineer, a technician, or just.. training data.

snvzz 2 days ago | parent | context | on: Glider – open-source eInk monitor with an emphasis...

Every eINK controller sucks. This person took upon themselves to fix that, and released the result, which is now the state of the art, as open source hardware.

I love people and projects like this.

taylodl 3 days ago | parent | context | on: The USDA's gardening zones shifted, this map shows...

There have been 3 updates to the zones in the past 50 years. Some of the updates are due to better accuracy after years of collecting data, but the 800-pound gorilla in the room is climate change. Where I live, winters are 4.5 degrees warmer. It has definitely affected my gardening.

pmcjones 1 day ago | parent | context | on: Adobe Photoshop Source Code (2013)

In the aughts I worked at Adobe and spent time trying to archive the source code for Photoshop, Illustrator, PostScript, and other apps. Thomas Knoll's original Mac floppy disk backups were available, so I brought in my Mac Plus, with a serial cable to transfer the files to a laptop via Kermit. The first version was 0.54, dated 6 July 1988. The files on the floppies were in various ancient compressed archive formats, but most were readable. I created an archive on a special Perforce server of all the code that I found. Sadly, the earliest Illustrator backups were on a single external disk drive that had gone bad.

ein0p 1 day ago | parent | context | on: Raspberry Pi Ltd is considering an IPO

It’s like $100 per board now once you add a power supply and a case. More if you also add storage. Cheapest Intel system on Amazon is $139. The whole point of the entire thing was its affordability. That was kind of lost along the way.

Negitivefrags 3 days ago | parent | context | on: GPT-4o

I found these videos quite hard to watch. There is a level of cringe that I found a bit unpleasant.

It’s like some kind of uncanny valley of human interaction that I don’t get on nearly the same level with the text version.

yreg 1 day ago | parent | context | on: Apple announces new accessibility features, includ...

Accessibility is for everyone, including you, if you live long enough. And the alternative is worse. So your choice is death or you are going to use accessibility features. – Siracusa

AnonC 2 days ago | parent | context | on: Not an iPad Pro Review: Why iPadOS Still Doesn't G...

I skimmed through the article but didn’t find mention of one glaring deficiency in iPadOS — it still doesn’t support multiple users and multiuser switching, even though the hardware is capable of it (and exceeds the capacity of many Macs before it). I decided several years ago that I’m not buying another iPad until this is sorted out by iPadOS.

I think of iPhones as personal devices, where each person may have their own. But iPads are more likely to be shared for personal use in families. The fact that each person using it cannot have their own user profiles, app data, etc., is a huge drawback. Apple has supported this for a long time (though probably not in the best way) for education, but it’s not available to others. Even tvOS supports switching between user profiles quickly.

Apple enforcing the idea that iPad (with iPadOS) should also be a personal device — one device per person — makes the user experience quite poor.

jsheard 3 days ago | parent | context | on: Amazon S3 will no longer charge for several HTTP e...

The system works! Just raise your concerns and they'll get around to it in [checks notes] 18 years

https://twitter.com/cperciva/status/1785402732976992417

paxys 1 day ago | parent | context | on: Bossware is a big legal risk

Everyone replying with "what's the big deal?" is showing their tech privilege. You may not have to deal with intrusive monitoring, but warehouse workers are increasingly being made to wear ankle bracelets so every movement of theirs can be monitored and stack ranked. Workers in WFH "gig" jobs are made to install always-on keyloggers and other monitoring software on their personal computers and phones (which are required for the job). Companies take photos/videos of them in their homes every few minutes throughout the day. Plenty of jobs require you to hand your social media passwords to your employer. There is an entire class of companies that specialize in all of this.

Not everyone is able to say "no" to all this and still make rent next month. I'm happy the government is finally stepping in.

zmmmmm 7 hours ago | parent | context | on: Slack AI Training with Customer Data

> For any model that will be used broadly across all of our customers, we do not build or train these models in such a way that they could learn, memorise, or be able to reproduce some part of Customer Data

This feels so full of subtle qualifiers and weasel words that it generates far more distrust than trust.

It only refers to models used "broadly across all" customers - so if it's (a) not used "broadly" or (b) only used for some subset of customers, the whole statement doesn't apply. Which actually sounds really bad because the logical implication is that data CAN leak outside those circumstances.

They need to reword this. Whoever wrote it is a liability.

purple-leafy 2 days ago | parent | context | on: Department of Justice says Boeing may be criminall...

Narrator: A new car built by my company leaves somewhere traveling at 60 mph. The rear differential locks up. The car crashes and burns with everyone trapped inside. Now, should we initiate a recall? Take the number of vehicles in the field, A, multiply by the probable rate of failure, B, multiply by the average out-of-court settlement, C. A times B times C equals X. If X is less than the cost of a recall, we don't do one.

Business woman on plane: Are there a lot of these kinds of accidents?

Narrator: You wouldn't believe.

Business woman on plane: Which car company do you work for?

Narrator: A major one.

eig 4 days ago | parent | context | on: Whole-body magnetic resonance imaging at 0.05 Tesl...

A few months ago there were articles going around about how Samsung galaxy phones were upscaling images of the Moon using AI [0]. Essentially, the model was artificially adding landmarks and details based on its training set when the real image quality was too poor to make out details.

Needless to say, AI upscaling as described in this article would be a nightmare for radiologists. 90% of radiology is confirming the absence of disease when image quality is high, and asking for complementary studies when image quality is low. With AI enhanced images that look "normal", how can the radiologist ever say "I can confirm there is no brain bleed" when the computer might be incorrectly adding "normal" details when compensating for poor image quality?

[0] - https://news.ycombinator.com/item?id=35136167

jdietrich 3 days ago | parent | context | on: GPT-4o

A Google search for practically any long-tail keywords will reveal that LLMs have already had a very significant impact. DuckDuckGo has suffered even more. Social media is absolutely lousy with AI-powered fraud of varying degrees of sophistication.

It's glib to dismiss safety concerns because we haven't all turned into paperclips yet. LLMs and image gen models are having real effects now.

We're already at a point where AI can generate text and images that will fool a lot of people a lot of the time. For every college-educated young person smugly pointing out that they aren't fooled by an image with six-fingered hands, there are far more people who had marginal media literacy to begin with and are now almost defenceless against a tidal wave of hyper-scaleable deception.

We're already at a point where we're counselling elders to ignore late-night messages from people claiming to be a relative in need of an urgent wire transfer. What defences do we have when an LLM will be able to have a completely fluent, natural-sounding conversation in someone else's voice? I'm not confident that I'd be able to distinguish GPT-4o from a human speaker in the best of circumstances and I'm almost certain that I could be fooled if I'm hurried, distracted, sleep deprived or otherwise impaired.

Regardless of any future impacts on the labour market or any hypothesised X-risks, I think we should be very worried about the immediate risks to trust and social cohesion. An awful lot of people are turning into paranoid weirdos at the moment and I don't particularly blame them, but I can see things getting seriously ugly if we can't abate that trend.

leashless 22 hours ago | parent | context | on: VCs aren’t your friends

Remember: if VCs believed in what they were doing they would not take a 2% annual management fee and 20% of the upside.

They’d take 40% of the upside and live on ramen noodles.

VCs make money by raising money from LPs.

They spend this money on investments which don’t look too bad if they fail, because nearly all of them fail. Looking good while losing all of your investors money on companies which go broke is the key VC skill.

Once in a while you get a huge hit. That’s a lottery win, there is no formula for finding that hit. Broad bets helps but that’s about it. The “VC thesis” is a fundraising tool, a pitch instrument, it makes no measurable difference to success. It’s a shtick.

Sympathy, however, for the VC: car dealership sized transactions paired with the diligence burdens of real finance. It’s a terrible job.

Once you understand that VC is one of the worst jobs in finance and they don’t believe most of their own story — it’s fundraising flimflam for their LPs - it’s a lot easier to negotiate.

1) we are a sound bet not to get you in trouble if we fail (good schools and track records)

2) we will work hard on things which your LPs and their lawyers understand, leaving evidence of a good effort on failure

3) we know how the game works and will play by the unwritten rules: keep up appearances

The kind of lunatics who actually stand to make money with a higher probability than average - the “Think Different” category - usually violate all of these rules.

1) they have no track record

2) they work on esoteric nonsense

3) they look weird in public

And they’re structurally uninvestable.

Once you get this it’s all a lot easier: the job of a VC is not to invest in winners, that’s a bonus.

The job of a VC is to look respectable while losing other people’s money at the roulette wheel, and taking a margin for doing so.

I hope that helps.

xnx 1 day ago | parent | context | on: Apple announces new accessibility features, includ...

I love accessibility features because they might be the last features developed solely with the benefit of the user in mind. So many other app/os features are designed to steal your attention or gradually nerf usefulness.

marcus_holmes 4 days ago | parent | context | on: Did GitHub Copilot increase my productivity?

Years ago, over a decade ago now, I was a .Net developer. Microsoft introduced Entity Framework, their new way of handling data in .Net applications. Promises made, promises believed, we all used it. I was especially glad of Lazy Loading, where I didn't have to load data from the database into my memory structures; the system would do that automatically. I could write my code as if all my memory structures were populated and not worry about it. Except, it didn't work consistently. Every now and again a memory structure would not be populated, for no apparent reason. Digging deep into technet, I found a small note saying "if this happens, then you can check whether the data has been loaded by checking the value of this flag and manually loading it if necessary" [0]. So, in other words, I have to manually load all my data because I can't trust EF to do it for me. [1]

Long analogy short, this is where I think AI for coding is now. It gets things wrong enough that I have to manually check everything it does and correct it, to the point where I might as well just do it myself in the first place. This might not always be the case, but that's where I feel it is right now.

[0] Entity Framework has moved on a lot since then, and apparently now can be trusted to lazily load data. I don't know because...

[1] I spat the dummy, replaced Windows with Linux, and started learning Go. Which does exactly what it says it does, with no magic. Exactly what I needed, and I still love Go for this.

hn_throwaway_99 3 days ago | parent | context | on: GPT-4o

I'm ceaselessly amazed at people's capacity for impatience. I mean, when GPT 4 came out, I was like "holy f, this is magic!!" How quickly we get used to that magic and demand more.

Especially since this demo is extremely impressive given the voice capabilities, yet still the reaction is, essentially, "But what about AGI??!!" Seriously, take a breather. Never before in my entire career have I seen technology advance at such a breakneck speed - don't forget transformers were only invented 7 years ago. So yes, there will be some ups and downs, but I couldn't help but laugh at the thought that "14 months" is seen as a long time...

yreg 3 days ago | parent | context | on: Protecting your email address via SVG instead of J...

> Email addresses published on webpages usually need to be protected from email-harvesting spambots.

Do they though?

I have had my email address published on my website in a <a href="mailto:… for like 20 years and I don't get spam that would get through the spam filter.

I use both Gmail and (for some other addresses) a webmail hosted by a local company which uses some other filter. Both work well, so it's not something only Google can do.