The NSA was tasked by Congress to monitor the entire internet so they “did”. They were overwhelmed with data and were happy to shut it down and go back to something more manageable when there was backlash. They don’t have genius hackers coming up with creative solutions to every problem, they have enterprise software and a bureaucracy and a mandate to track certain groups which in the time of their “monitor the whole internet” were religious terrorists and 2nd world hackers. They were busy with their job.
Data is expensive. It wasn’t permanently stored, they don’t have AI that just knows how to sort it. The new useful AI today doesn’t sort it, it takes it to generate new data which is useless for this purpose.
The difference between being targeted and known and being automatically known when caught in a dragnet of data is everything. If the FBI wants to know you, subpoenas your ISP and gets access to your accounts, they’ll know you. They will need at least an agent reviewing your information. The FBI is a little busy and took quite a while tracking down people whose faces were in videos in the capitol. This is not an unlimited org. Neither is the NSA with a backlog of military and espionage concerns.
Facebook and Google have the best AI driven revenue technology. It uses targeted advertising. Google had my profile very wrong ten years ago. Facebook, which has direct info, finally figured out things to sell me in 2022 for a little while. Mostly ads for things I owned or adjacent products I would never want, but it fed me sales on a few products I wanted. Made sense I was buying a ton of those products on the browser profile with my Facebook. Since not buying more, they no longer can figure out what I buy. They advertise a lot in Spanish, which is cute but backward. AI at its best can anticipate genre but never quality.
A great example of that is Spotify which has one of the best recommendation algorithms. It’s a low bar. If you don’t work with its confines, it gives you terrible recommendations. You have to save way too many songs to your library before it even gets close to knowing, and even then, it has no idea you might be discerning on axes of quality, which as far as it is algorithmically concerned, is a black box of an arbitrary subjective mess. And it is, except that it totally isn’t. Believe me I have talked to gpt4 about it, the existing literature cannot synthesize an understanding of that quasi objective quality that people “get” when they are awake to it. But Spotify also completely fails to deep learn or spontaneously match people on a “I only want the best in xyz genres” with the others like that. No, it is a highly parameterized monstrosity ultimately stuck on genre, limited to the creative efforts of its team, mostly crystallized some years back.
Reddit has an incredible sophisticated recommendation system. It too has to same fundamental design flaw and will eventually just give you whatever thing you looked at one time over and over again to the detriment of things you actually and regularly want to read. Often times a fresh browser Reddit gives me better content for me than what I log in and see.
These companies, and especially google and Facebook, have funded much of the AI research and have the clearest channels and biggest incentives to use it, and they are mid at best. They’ve had me on the same accounts for almost 20 years and Facebook alone could barely advertise to me for a few good months.
There’s a bench of a girls memorial up the hill from me. She died I think 2011, born maybe 1988. Everything was online by 2011, especially deaths and formal things like county benches. She doesn’t even exist. Some people aggregators have her “living” at a couple different last addresses, no information. No obituary and for sure there was one, her family got her a stoner bench in a choice spot ffs! Not online.
AI and the hardware has never been better, and it’s all still trash except for tools here and there like using chat gpt to help you with a topic that is well documented like coding. Still requires a coder to make it work though, it’s laden with hallucinations. The difference between putting agents on investigating a person and getting meaningful mass data is a world apart, and has not been solved. Storage is expensive and while a few companies are trying to save what they can, most are trying to dump that information as fast as possible.