Komunitas
lemmy.world
The Gemini podcast is going to condense your text and make it conversational, but it will necessarily lose detail in the process. A better recommendation is the Eleven Labs Reader, it’ll just read any text or file you throw at it with top tier voice models. Can use it for free and they have paid plans for more use. They also have a “podcast” generator option like Gemini, but I haven’t tried it so can’t vouch for the quality. I use Eleven Labs all the time for things I want to read, like email newsletters, industry publications, etc but never find the time to sit down and read. Now I can have AI read them to me while I walk the dog. Super handy imo
Komunitas
lemmy.dbzer0.com
So, what this seems to be saying, and what is reflected in the settings screens I can access on devices I have at hand, is that Gemini will still have (limited) functionality hooking jnto certain apps even if you’ve disabled “Gemini App history”. ~~The app is being pushed through normal updates via Google Play (by your carrier or Google itself). In some cases the app can be uninstalled by the end user through the normal UI, and in other cases it installs as a system app and requires adb to be used to disable and/or uninstall it.~~ EDIT: FALSE, GOOGLE HAS NOT PUSHED THE APP OUT YET, JUST A SETTINGS PAGE As always, the most secure way to use android is through a custom locked down ROM like Graphene OS.
Komunitas
jlai.lu
[ iA : GPT-4o sur macOS et Gemini + comparatif entre les 2 et Copilot ] [ app GPT-4o sur macOS ] Bonjour le Jeudi Tech, OpenAi a annoncé lundi que GPT-4o serait dispo sur desktop (macOS) La page openai.com/chatgpt/download/ indique “ChatGPT Desktop app for Mac is coming soon.” sur ce site https://www.numerama.com/tech/1742738-chatgpt-comment-telecharger-lapplication-sur-son-ordinateur.html ils donnent un lien vers le “.dmg” pour installer GPT-4o. Si vous êtes GPT-Plus vous avez sûrement + de chances d’y avoir déjà accès. Il parait qu’il faut essayer de se connecter plusieurs fois en cas d’échec pour avoir une chance. De mon côté j’ai ce message quand j’essaye de me connecter, même après plusieurs tentatives (j’ai ressayé ce jeudi après-midi) : “Coming Soon” “You do not have access to this app yet!” (je ne suis pas GPT-Plus, c’est peut-être pour ça : ( Si ça marche chez vous, n’hésitez pas à prévenir. Voici le pricing de la version payante mais je ne sais pas ce qu’elle apporte de + (à part + de tokens ?) Model gpt-4o Input 5,00 $US / 1M tokens Output 15,00 $US / 1M tokens [ Google Gemini ] dispo sur gemini.google.com Gémini Advanced : 0 € pendant 2 mois, puis 21,99 €/mois Également inclus dans cet abonnement Google One. Gemini dans Gmail, Docs et d’autres services 2 To d’espace de stockage pour Google Photos, Drive et Gmail Autres avantages premium Google One “Partagez votre forfait avec cinq personnes maximum” “Vos photos, fichiers et e-mails restent privés, même si vous partagez votre espace de stockage.” Je vais comparer avec le forfait Google One actuel. Mais si vous avez des infos hésitez pas à partager. Avez-vous testé le nouveau Gemini Advanced ? C’est vraiment mieux que le Gemini “normal” ? [ Comparatif GPT-4o / Gemini / Microsoft Copilot ] Même si c’est récent, avez-vous pu comparer les 3 ? Pour générer du code vous conseillez quelle “iA” ? (du code basique). J’avais été déçu par Gemini il y a quelques temps et préférais ChatGPT dont les tokens semblaient + longs au 1er prompt (version gratuite). Merci !
Komunitas
lemmy.world
Agent 1: Mistral Vibe Overall rating: 4/10 This version got many of the basics right but left out chording and didn’t perform well on the small presentational and “fun” touches. Agent 2: OpenAI Codex Overall: 9/10 The implementation of chording and cute presentation touches push this to the top of the list. We just wish the “fun” feature was a bit more fun. Agent 3: Anthropic Claude Code Overall: 7/10 The lack of chording is a big omission, but the strong presentation and Power Mode options give this effort a passable final score. Agent 4: Google Gemini CLI Overall: 0/10 (Incomplete) Final verdict OpenAI Codex wins this one on points, in no small part because it was the only model to include chording as a gameplay option. But Claude Code also distinguished itself with strong presentational flourishes and quick generation time. Mistral Vibe was a significant step down, and Google CLI based on Gemini 2.5 was a complete failure on our one-shot test. While experienced coders can definitely get better results via an interactive, back-and-forth code editing conversation with an agent, these results show how capable some of these models can be, even with a very short prompt on a relatively straightforward task. Still, we feel that our overall experience with coding agents on other projects (more on that in a future article) generally reinforces the idea that they currently function best as interactive tools that augment human skill rather than replace it.
Komunitas
lemmy.ml
Ja, genau, der Markdown-artige HTML-Ersatz heißt Gemtext. Also wirklicher Vorteil ist bestenfalls, dass es simpel ist. Hier ist ein Beispiel-Austausch in Gemini: Client schickt: gemini://example.com/ Server schickt: 20 text/gemini # Example Title Welcome to my Gemini capsule. * Example list item => gemini://link.to/another/resource Link text Alles nach der ersten Zeile des Servers ist schon Inhalt. Das ist eben so simpel, dass es komplexer wäre, wenn du einen bestehenden HTTP-Browser anpassen würdest, um zusätzlich Gemtext zu unterstützen. Auch mMn sehr eindrücklich ist, dass das hier die vollständige, formale Definition des Gemini-Protokolls ist, und das hier die vollständige, formale Definition von Gemtext. Das ist 'ne nette Nachmittagslektüre. Also ja, HTTP ist im Kern auch nicht wahnsinnig komplex, aber wenn man alle Eventualitäten unterstützen will, dann wird es doch schon komplexer. Mal von der Technologie abgesehen, und wo es auch extrem subjektiv wird, ob man das als Vorteil ansieht, ist Gemini eben auch bewusst ein bisschen abgegrenzt vom World Wide Web (mit HTTP und HTML). Stattdessen baut es sich seine eigene kleine Welt (“Geminispace”) auf, die bewusst nicht alle Möglichkeiten des World Wide Web bereitstellt. Zitat z.B. von der Gemini-Webseite: Gemini isn’t about innovation or disruption, it’s about providing some respite for those who feel the internet has been disrupted enough already. We’re not out to change the world or destroy other technologies. We are out to build a lightweight online space where documents are just documents, in the interests of every reader’s privacy, attention and bandwidth. Was in der Praxis daraus gemacht wird, ist natürlich nochmal was anderes. Also gibt durchaus mehr-oder-weniger-ernst gemeinte Stimmen, die von “Burn the Web” sprechen, weil halt so viel des World Wide Webs kommerzialisiert ist. Spätestens jetzt mit den ganzen KIs, die alles zu-spammen, was irgendwie monetarisiert werden kann, passiert das ja auch schon teilweise von selbst. Und dann gibt es eben auch mittlerweile eine kleine Community, die sich im Geminispace findet. Der kleinste, gemeinsame Nenner bei Interessen ist eben Technologie und Nachhaltigkeit, aber man kann auch alles mögliche an Blogs dort lesen und darüber dann die Leute kennen lernen.
Komunitas
lemmy.ml
I agree with you sentiment here. That’s why I wrote ‘relative terms’ in my comment. Since Nadela took over, Microsoft did some open thing which benefited community. So, Microsoft opened somewhat. During the same time, under Pichai, google went the other way: they focus more on monetization and try to control stuff the apple way. Manifest v3? Google also didn’t do anything really worth mentioning in the last 10y in terms of products. Well, except ‘attention’ article. And even this they didn’t believe in and they cannot deliver a decent product. I just tried google advanced Gemini and it’s, to put it politely, shit. Google also had some positive actions like mainlining a lot of stuff in Linux Kernel to more easily upgrade android. So, while google is closing down and making mistakes, Microsoft is opening a bit up. If you look the state from the last year and the state now. Microsoft improved. Google went the other way. Microsoft doesn’t care about open source, they care about the money Cloud Services using open source bring them. I don’t think google cares as well. For reason read this: https://www.joelonsoftware.com/2002/06/12/strategy-letter-v/
Komunitas
lemmy.ca
Talk about feeling old I played almost all the games until 2006, but fewer of the list after and hardly from the 2016 one. Not trying to one up but figured good place to comment. It’s been a fun time seeing progression over the years, haven’t stopped playing since my first Gemini (Atari 2600 knockoff that played the same games) system. Going from 2d squares to a proper video representation has been awesome. I always enjoyed the graphic improvements the gameplay has had its ups and downs but the remembered ones did that part right.
Komunitas
lemmy.saik0.com
And here I am with a 5 server cluster, 2x custom servers running opnsense for redundancy (8gbps internet connection needs real horsepower for IDS/firewall/routing), and a 36 bay storage truenas node… that’s getting upgraded to 72 bay version for more drives (34 additional drives ready for install RIGHT NOW)… I see your 50 and 38 W… and raise you This 2200-ish watts? Oh… and cooling the servers to keep them to about 75 degrees intake temp. So really closer to 3400 watts. Taking your numbers of 6 watts saved per drive would only save me 180w currently and 432w after I install the additional 32 drives next week. I’d still be in the 3kW territory. … I also have solar… I generate (orange) enough to export (purple) a little during the day… but that’s about it… Battery (light green) usage just kills peak hours. The electrical usage costs me about $100-110 a month in electricity after solar ($0.06 per kWh), probably closer to $150 if solar wasn’t eating up a bunch of it. Less than subscriptions to all the shit that I’m hosting for myself by a long long shot. Forget the family and other users. ::: spoiler More information… Not even remotely necessary to read. Nextcloud - 5TB, google drive is $10/mo for 2TB MSTY - AI stuff, another $10/mo subscription if you want google gemini. $20 for ChatGPT. Minecraft - private, $5 a month minimum. Probably closer to $10 for reasonable specs to do anything with the kiddos. Email - 1TB across all users right now, ~$5 minimum for just me, though I’m oversize for many platforms as I have everything going back to 2006 or so. So probably close to $8-10 for just me. Private search aggregator - apparently a paid service now with the likes of kagi. $10 Home assistant - $6.50 through nobucasa. $46-66 on this stuff alone… Frigate… 8 cameras with corals for inferencing. God know what that cost would be. I keep 30 days of 24 hours, 6 months of detected items and 1 year of snapshots. I’m at 50TB of usage there. This probably could/should be cut down significantly, at least halved. but even 25TB is a fuck-ton of money per month on any VPS/hosted system. Ring’s plan is about half what I’m doing at $20/mo. No idea what other services would end up being. Not even sure how ring and other make money at that cost when storage is expensive otherwise. Paperless-ngx, lubelog, grocy, gramps for organization/documentation would need a VPS service… or migrating to a non-hosted solution (so can’t really be shared easily, or shared through google docs sort of thing). Self-hosted things like lemmy, mastodon, matrix, peertube, etc… VPS costs would be something substantial as well. And business operation stuff like my invoices, jump hosts, secure vms, etc… And lastly, the cost of owning my own data… where no company can spy on me. Or monetize me in ads. Invidious, my own dns with custom rules for me vs the kids, etc…etc…etc… Priceless. Then multiply the parts of the list for other users on my system (wife, both kids, father, etc…) And of course the massive porn collection… Gotta have that on a moments notice. :::
Komunitas
lemmy.dbzer0.com
This is my big concern. Right now Gemini is an option you can switch on to replace the existing assistant, which I expect has similar terms. But how long will it be until Google just integrates this with their email, search, and online office suite with no options to disable it? They’ll tout it as an improvement and new features. Microsoft at least has to cater to business customers, so there will be options for systems administrators to opt-out for longer. With their government contracts they will have to prove adequate security. I still don’t like the AI push, or Microsoft as a whole, but I trust them not to have a data leak, or to sell business data to whoever. They don’t have overwhelming financial incentives in advertising or data collection for it, just normal sized incentives. On the other hand, Google’s biggest revenue stream is advertising, and that works due to the absurd amount of non-paying users they have with their free services. They have no business or financial incentives whatsoever to not just offer all this data they collect up on a silver platter. No incentives not to train horrible dystopian AI to maximize advertising effectiveness through A/B testing specific market/interest groups on an unimaginable scale. Google also has a history of collecting more data than they were allowed to, pinning it on a “rogue employee enabling a feature they were told to disable” when they are caught, and then proceeding to use that data anyway for their projects after the news dies down. I’ve always wanted to see a true “AI” personal assistant, leveraging tech to make lives easier, but this shit is not the way.
Komunitas
lemmy.sdf.org
cross-posted from: https://lemmy.sdf.org/post/31525284 Archived […] While the financial, economic, technological, and national-security implications of DeepSeek’s achievement have been widely covered, there has been little discussion of its significance for authoritarian governance. DeepSeek has massive potential to enhance China’s already pervasive surveillance state, and it will bring the Chinese Communist Party (CCP) closer than ever to its goal of possessing an automated, autonomous, and scientific tool for repressing its people. Since its inception in the early 2000s, the Chinese surveillance state has undergone three evolutions. In the first, which lasted until the early 2010s, the CCP obtained situational awareness — knowledge of its citizens’ locations and behaviors — via intelligent-monitoring technology. In the second evolution, from the mid-2010s till now, AI systems began offering authorities some decision-making support. Today, we are on the cusp of a third transformation that will allow the CCP to use generative AI’s emerging reasoning capabilities to automate surveillance and hone repression. […] China’s surveillance-industrial complex took a big leap in the mid-2010s. Now, AI-powered surveillance networks could do more than help the CCP to track the whereabouts of citizens (the chess pawns). It could also suggest to the party which moves to make, which figures to use, and what strategies to take. […] Inside China, such a network of large-scale AGI [artificial general intelligence] systems could autonomously improve repression in real time, rooting out the possibility of civic action in urban metropolises. Outside the country, if cities such as Kuala Lumpur, Malaysia — where China first exported Alibaba’s City Brain system in 2018 — were either run by a Chinese-developed city brain that had reached AGI or plugged into a Chinese city-brain network, they would quietly lose their governance autonomy to these highly complex systems that were devised to achieve CCP urban-governance goals. […] As China’s surveillance state begins its third evolution, the technology is beginning to shift from merely providing decision-making support to actually acting on the CCP’s behalf. […] DeepSeek […] is this technology that would, for example, allow a self-driving car to recognize road signs even on a street it had never traveled before. […] The advent of DeepSeek has already impelled tech experts in the United States to take similar approaches. Researchers at Stanford University managed to produce a powerful AI system for under US$50, training it on Google’s Gemini 2.0 Flash Thinking Experimental. By driving down the cost of LLMs, including for security purposes, DeepSeek will thus enable the proliferation of advanced AI and accelerate the rollout of Chinese surveillance infrastructure globally. […] The next step in the evolution of China’s surveillance state will be to integrate generative-AI models like DeepSeek into urban surveillance infrastructures. Lenovo, a Hong Kong corporation with headquarters in Beijing, is already rolling out programs that fuse LLMs with public-surveillance systems. In Barcelona, the company is administering its Visual Insights Network for AI (VINA), which allows law enforcement and city-management personnel to search and summarize large amounts of video footage instantaneously. […] The CCP, with its vast access to the data of China-based companies, could use DeepSeek to enforce laws and intimidate adversaries in myriad ways — for example, deploying AI police agents to cancel a Lunar New Year holiday trip planned by someone required by the state to stay within a geofenced area; or telephoning activists after a protest to warn of the consequences of joining future demonstrations. It could also save police officers’ time. Rather than issuing “invitations to tea” (a euphemism for questioning), AI agents could conduct phone interviews and analyze suspects’ voices and emotional cues for signs of repentance. […]
Komunitas
lemmy.sdf.org
cross-posted from: https://lemmy.sdf.org/post/31525284 Archived […] While the financial, economic, technological, and national-security implications of DeepSeek’s achievement have been widely covered, there has been little discussion of its significance for authoritarian governance. DeepSeek has massive potential to enhance China’s already pervasive surveillance state, and it will bring the Chinese Communist Party (CCP) closer than ever to its goal of possessing an automated, autonomous, and scientific tool for repressing its people. Since its inception in the early 2000s, the Chinese surveillance state has undergone three evolutions. In the first, which lasted until the early 2010s, the CCP obtained situational awareness — knowledge of its citizens’ locations and behaviors — via intelligent-monitoring technology. In the second evolution, from the mid-2010s till now, AI systems began offering authorities some decision-making support. Today, we are on the cusp of a third transformation that will allow the CCP to use generative AI’s emerging reasoning capabilities to automate surveillance and hone repression. […] China’s surveillance-industrial complex took a big leap in the mid-2010s. Now, AI-powered surveillance networks could do more than help the CCP to track the whereabouts of citizens (the chess pawns). It could also suggest to the party which moves to make, which figures to use, and what strategies to take. […] Inside China, such a network of large-scale AGI [artificial general intelligence] systems could autonomously improve repression in real time, rooting out the possibility of civic action in urban metropolises. Outside the country, if cities such as Kuala Lumpur, Malaysia — where China first exported Alibaba’s City Brain system in 2018 — were either run by a Chinese-developed city brain that had reached AGI or plugged into a Chinese city-brain network, they would quietly lose their governance autonomy to these highly complex systems that were devised to achieve CCP urban-governance goals. […] As China’s surveillance state begins its third evolution, the technology is beginning to shift from merely providing decision-making support to actually acting on the CCP’s behalf. […] DeepSeek […] is this technology that would, for example, allow a self-driving car to recognize road signs even on a street it had never traveled before. […] The advent of DeepSeek has already impelled tech experts in the United States to take similar approaches. Researchers at Stanford University managed to produce a powerful AI system for under US$50, training it on Google’s Gemini 2.0 Flash Thinking Experimental. By driving down the cost of LLMs, including for security purposes, DeepSeek will thus enable the proliferation of advanced AI and accelerate the rollout of Chinese surveillance infrastructure globally. […] The next step in the evolution of China’s surveillance state will be to integrate generative-AI models like DeepSeek into urban surveillance infrastructures. Lenovo, a Hong Kong corporation with headquarters in Beijing, is already rolling out programs that fuse LLMs with public-surveillance systems. In Barcelona, the company is administering its Visual Insights Network for AI (VINA), which allows law enforcement and city-management personnel to search and summarize large amounts of video footage instantaneously. […] The CCP, with its vast access to the data of China-based companies, could use DeepSeek to enforce laws and intimidate adversaries in myriad ways — for example, deploying AI police agents to cancel a Lunar New Year holiday trip planned by someone required by the state to stay within a geofenced area; or telephoning activists after a protest to warn of the consequences of joining future demonstrations. It could also save police officers’ time. Rather than issuing “invitations to tea” (a euphemism for questioning), AI agents could conduct phone interviews and analyze suspects’ voices and emotional cues for signs of repentance. […]
Komunitas
lemmy.sdf.org
Archived […] While the financial, economic, technological, and national-security implications of DeepSeek’s achievement have been widely covered, there has been little discussion of its significance for authoritarian governance. DeepSeek has massive potential to enhance China’s already pervasive surveillance state, and it will bring the Chinese Communist Party (CCP) closer than ever to its goal of possessing an automated, autonomous, and scientific tool for repressing its people. Since its inception in the early 2000s, the Chinese surveillance state has undergone three evolutions. In the first, which lasted until the early 2010s, the CCP obtained situational awareness — knowledge of its citizens’ locations and behaviors — via intelligent-monitoring technology. In the second evolution, from the mid-2010s till now, AI systems began offering authorities some decision-making support. Today, we are on the cusp of a third transformation that will allow the CCP to use generative AI’s emerging reasoning capabilities to automate surveillance and hone repression. […] China’s surveillance-industrial complex took a big leap in the mid-2010s. Now, AI-powered surveillance networks could do more than help the CCP to track the whereabouts of citizens (the chess pawns). It could also suggest to the party which moves to make, which figures to use, and what strategies to take. […] Inside China, such a network of large-scale AGI [artificial general intelligence] systems could autonomously improve repression in real time, rooting out the possibility of civic action in urban metropolises. Outside the country, if cities such as Kuala Lumpur, Malaysia — where China first exported Alibaba’s City Brain system in 2018 — were either run by a Chinese-developed city brain that had reached AGI or plugged into a Chinese city-brain network, they would quietly lose their governance autonomy to these highly complex systems that were devised to achieve CCP urban-governance goals. […] As China’s surveillance state begins its third evolution, the technology is beginning to shift from merely providing decision-making support to actually acting on the CCP’s behalf. […] DeepSeek […] is this technology that would, for example, allow a self-driving car to recognize road signs even on a street it had never traveled before. […] The advent of DeepSeek has already impelled tech experts in the United States to take similar approaches. Researchers at Stanford University managed to produce a powerful AI system for under US$50, training it on Google’s Gemini 2.0 Flash Thinking Experimental. By driving down the cost of LLMs, including for security purposes, DeepSeek will thus enable the proliferation of advanced AI and accelerate the rollout of Chinese surveillance infrastructure globally. […] The next step in the evolution of China’s surveillance state will be to integrate generative-AI models like DeepSeek into urban surveillance infrastructures. Lenovo, a Hong Kong corporation with headquarters in Beijing, is already rolling out programs that fuse LLMs with public-surveillance systems. In Barcelona, the company is administering its Visual Insights Network for AI (VINA), which allows law enforcement and city-management personnel to search and summarize large amounts of video footage instantaneously. […] The CCP, with its vast access to the data of China-based companies, could use DeepSeek to enforce laws and intimidate adversaries in myriad ways — for example, deploying AI police agents to cancel a Lunar New Year holiday trip planned by someone required by the state to stay within a geofenced area; or telephoning activists after a protest to warn of the consequences of joining future demonstrations. It could also save police officers’ time. Rather than issuing “invitations to tea” (a euphemism for questioning), AI agents could conduct phone interviews and analyze suspects’ voices and emotional cues for signs of repentance. […]
Komunitas
infosec.pub
Google can’t make a move in 2025 without veering into the realm of generative AI, and the release of the Pixel 9a is no exception. Curiously, the AI experience on this phone may not match what you’ve seen from the company’s high-end smartphones. Google has confirmed to Ars that the phone’s lower memory prevented it from implementing the full suite of Pixel AI features. You can still talk to Gemini by holding the power button or opening the Gemini app, but the on-device Gemini Nano model has seen a downgrade on the 9a.
Komunitas
europe.pub
cross-posted from: https://programming.dev/post/26984046 All analyzed AI chatbot apps collect some form of user data. The average number of collected types of data is 11 out of a possible 35 for the analyzed apps. 40% of the apps collect users’ locations. Additionally, 30% of these apps track user data. Tracking refers to linking user or device data collected from the app with third-party data for targeted advertising or advertising measurement purposes or sharing it with a data broker. Google Gemini collects the most information, gathering 22 out of 35 possible data types. This includes precise location data, which only Gemini, Copilot, and Perplexity collect. Gemini also collects a significant amount of data across various other categories, such as contact info (name, email address, phone number, etc.), user content, contacts (such as a list of contacts in the user’s phone), search history, browsing history, and several other types of data. This extensive data collection may be seen as excessive and intrusive by those concerned about data privacy and security. ChatGPT collects 10 types of data, such as contact information, user content, identifiers, usage data, and diagnostics, while avoiding tracking data or using third-party advertising within the app. While ChatGPT collects chat history, it is possible to use temporary chats, which auto-delete all data after 30 days, or to request the removal of personal data from training sets. Overall, ChatGPT collects slightly fewer types of data than some other analyzed apps, but users should still review the privacy policy to understand how this data is used and protected. Copilot, Poe, and Jasper are the three apps that collect data used to track you. This data could be sold to data brokers or used to display targeted advertisements in your app¹. While Copilot and Poe only collect device IDs, Jasper collects device IDs, product interaction data, advertising data, and other usage data, which refers to “any other data about user activity in the app”. DeepSeek’s data collection practices stand comfortably in the middle ground among other AI chatbot apps. DeepSeek collects 11 unique types of data, such as user input, including chat history, and claims to retain information for as long as necessary, storing it on servers located in the People’s Republic of China. Don’t let your guard down, as chats stored on servers are always at risk of being breached. According to The Hacker News, DeepSeek has already experienced a breach where more than 1 million records of chat history, API keys, and other information were leaked. It is generally a good idea to be mindful of the information provided.
Komunitas
lemmy.world
Introduction Why does Google insist on making it’s assistant situation so bad? In theory, assistant should be the best it’s ever been. It’s better at “understanding” what I ask for, and yet it less capable than ever to do so. This post is a rant about my experience using modern assistants on Android, and why, while I used to use these features actively in the mid-to-late-2010s, I now don’t even bother with them. The task Back in the late 2010s, I used to be able to hold the home button and ask the Google Assistant to create an event based on this email. It would grab the context from my screen, and do exactly that. This has been impossible, as far as I can tell, to do for years now. Trying to find the “right” assistant At some point, my phone stopped responding to “OK Google”. I still don’t know why it won’t work. Holding down the Home bar (the home button went the way of the dodo) brings up an assistant-style UI, but it’s dumb as bricks and only Googles the web. Useless. So, I installed Gemini. I asked it to perform a basic task. It responded “in live mode, I cannot do that”. Asking it how I can get it to create me a calendar event, it could not answer the question. Saying instead to open my calendar app and create a new event. I know how to use a calendar. I want it to justify its existence by providing more value than a Google search. It was ultimately unable to answer the question. Searching the internet, apparently both of the ways I had been using assistant features were the wrong way to do it. You have to hold down the power button, that’s how to launch the proper one. My internal response was: No, that’s for the power menu. I don’t want to dedicate it to Assistant. Well, apparently, that’s the only way to do it now, so there I go sacrificing another convenience turning it on. Pulling teeth with Gemini So I ask this power-menu-version of Gemini to do the same simple task. I tried 4 separate times. First, it created a random event “Meeting with a client” on a completely different day (what?). Second time it just crashed with an error. The third time, it asked me which email to use, giving me a list, but that list did not contain the email I was interested in. I asked it to find the Royal Mail one. No success. So, quite clearly, it wasn’t using screen content. I rephrased the question: “Please create an event from the content on my screen”. It replied “Sure, when’s this for?” I shouldn’t have to tell you. That’s the point. It’s right there. Conclusion There are too many damn assistant versions, and they are all bad. I can’t even imagine what it’s like to also have Bixby in the mix as a Samsung user. (Feel free to let me know below.) It seems like none of them are able to pull context from what you are doing anymore, and you’ll spend more time fiddling and googling how to make them work than it would take for you to do the task yourself. In some ways, assistants have gotten worst than almost 10 years ago, despite billions in investments. As a little bonus, the internet is filled with AI slop that makes finding out real facts, real studies from real people harder than ever. I write this all mostly to blow off steam, as this stuff has been frustrating me for years now. Let me know what your experience has been like below, I could use some camaraderie.
Komunitas
beehaw.org
archive link The company has announced an expansion of its AI search features, powered by Gemini 2.0. Everyone will soon see more AI Overviews at the top of the results page, but Google is also testing a more substantial change in the form of AI Mode. This version of Google won’t show you the 10 blue links at all—Gemini completely takes over the results in AI Mode. … If this sounds like something you absolutely do not want, you can safely ignore it for now. The experimental feature is only available for Google One AI Premium subscribers, who pay $20 per month for access to Google’s best LLMs. This could be an indication that generating these search pages is extremely costly even for a company that gives away so much AI processing for free. Still, Google’s AI efforts move fast, and you could find yourself confronted with AI Mode soon. It only took a few months for the Search Generative Experience to graduate from Labs as AI Overviews. from the primary source on Google’s own blog: As we’ve rolled out AI Overviews, we’ve heard from power users that they want AI responses for even more of their searches. uh-huh. sure. “power users” have been banging down your door and insisting they want more search results that say it’s safe to eat one rock per day. and these “power users” have apparently also been demanding that Google remove the normal search results that appear below the AI-generated slop.
Komunitas
lemmy.world
Increasing Gemini app usage is said to be Google’s “biggest focus” in 2025, with the company recently buying podcast ads to drive downloads.
Komunitas
lemmy.ml
Kagi Assistant adds Sonnet 3.7 and the preview of multi-step reasoning assistant called Ki We’re happy to unveil the latest updates to our assistant experience. This release brings a smoother, smarter, and more intuitive interface designed to make your interactions simpler. Try it out and feel the difference! We also added Claude 3.7 Sonnet to our model lineup! It’s now powering our !code assistant and plays a key role in our advanced multi-step reasoning model, Kii. Currently in early testing, Ki is available exclusively to our Discord community members - join now to get early access (Ultimate account users only)! And today OpenAI released GPT4.5 - we have already benchmarked it. Check Kagi LLM benchmark. TikTok video search Video search just got even sharper—you can now filter specifically for Tiktok videos. Find exactly what you’re looking for, faster. We are hiring a Flutter developer! We just opened a position for a skilled Flutter Developer. Apply now or send someone our way. Improvements and bug fixes Search Show tiktok as a video source in video search #6356 @Thibaultmol Kagi Search: Using “Quick Answer” on a search query that has an ampersand (&) will ignore the stuff after the ampersand #6085 @__ Unclear Privacy Policy on Search Logging #5276 @UnderdogSquad Currency conversion, i.e., “100 usd to sek”, should display widget #2031 @Vapid Outage of summarize feature in france #6314 @TarekLazaar Low quality translated Reddit results #5212 @bram Summarizer doesn’t work with WalletHub #6324 @yokoffing User submitted themes #6332 @Kanx Search switching to an earlier search on it’s own #6286 @Boomkop3 Errant summary section when conducting specific search #6174 @linuxpng Kagi Search: When the search suggestions are open, pressing Enter won’t search #6147 @__ FastGPT bangs (!fast and !fgpt) redirect to Kagi Search instead of Kagi FastGPT #6167 @doggofan Can’t search for things beginning with capital I #6308 @Ludwik Summarizer can’t capture Youtube tittles #6281 @sleepysnooze Improve performance results page #5250 @Thibaultmol Can’t apply region with keyboard #6185 @goulot_situez Kagi Search: “More” button is misaligned when using the “Larger” font #6063 @__ Unclickable link on Wikipedia sidebar widget #6266 @RoxyRoxyRoxy “exact match” for reverse image search #6036 @Thibaultmol Publish date for news search is in a language different from the interface language #5437 @huey CSS toggle and Status removed from the Control Center in Assistant #6250 @Temanor Images widget blank on mobile #6243 @tuesday Asking “WNBA standings?” in kagi gets everything wrong #4245 @alexisd New York is a European city according to Kagi Quick Answer #4378 @Repacking6528 “Quick Answer” not recognizing outdated information #5086 @Kagibeh Same image appears in any search regardless of terms #6382 @dabluecaboose Assistant Ability to download Assistant response files Asking AI on Kagi-ChatGPT (for example) is different from asking ChatGPT directly. Why is this? #6136 @Paato Can’t use ‘Gemini 2.0 Flash’ #6302 @jyyen Claude Efficiant Tool Usage Beta #6364 @HRA42 Assistant “Document Size Exceeded” Error for 4mb picture #6370 @blackbird2150 Adjust Saved Code Snippet File Extension #6378 @IsaacThoman “Ask questions about page” doesn’t open Assistant #6108 @mullinggroove Mac Kagi Assistant shortcuts not working #6343 @miicat_47 Make title of Assistant reference a hyperlink #6350 @RoxyRoxyRoxy Kagi Assistant Downloads as .html #6207 @lloyd094 Translate Try Kagi Translate at https://translate.kagi.com/ If on ‘manual’ translate mode, do a translate when the user changes target language #6037 @Thibaultmol Improve Korean Localization #5305_334, #5305_329 @Hanbyeol Don’t show alternative translations in JP, if the only thing that’s changed from original is the pronunciation being added. #5305_328 @frin JP romanization is sometimes incorrect/overly formal #5305_328 #5305_323 @frin German (Switzerland) translations using the wrong “ss” #5305_314 @psy-q Translated text sometimes put into quotation marks #5305_306 @nichu42 Disable/export/delete translation history #5305_308 @ajimix CTRL+A should only select the translated text, not whole page #5305_316 @RoxyRoxyRoxy Add Hawaiian to list of supported languages #5305_320 @Bradh Change URL params when translation is completed #5305_323 @frin While proofreading hindi, strike/correct whole diacritic / ligature. @aldehyde.8578 Document translation: translate your .doc(x), .txt and .csv documents Alternative translations: along the main translation, show multiple translation options Word insights: fine-tune individual words or phrases https://translate.kagi.com/iso_code now redirects to the main page, with the to language being set to [iso_code]. Make tooltips appear immediately, adjust style Dynamic text size depending on length in input/output box Increase max size of request header to allow longer romanization requests [Forgot to credit last time] Add translation context field #5305_253 @Kate-Karui Kagi on Socials This week’s featured social media post: Tag our accounts or use #Kagi when mentioning us in your posts! Kagi in the News The Verge covered Kagi’s recent Privacy Pass feature. Privacy Pass was also mentioned on multiple podcasts, including Enterprise Security Weekly and Techlore’s Surveillance Report. Interested in covering Kagi on your outlet, newsletter or podcast? Hit us up! Our team is very approachable and we welcome any opportunity to engage with various communities about our latest features.
Komunitas
lemmy.zip
My favorite way to use it has to be the Nuke + Gemini Turret strategy. Toss one nuke at the enemy, drop the other right in front of yourself, immediately pack it back up, wait for cooldown, repeat.
Komunitas
lemmy.ml
Announcing Kagi Privacy Pass Kagi now supports Privacy Pass, an IETF-standardized protocol ensuring your searches are technically unlinkable to your account. Read all about why this matters and the details of implementation in our announcement blog post! Privacy Pass support is provided: Natively for Orion browser users (macOS/iOS/iPadOS). On iOS, make sure to have version 1.3.17 and above (expected to roll out globally today) and update your macOS Orion to version 0.99.131. Natively through Kagi App for Android (make sure to have version 0.29, expected to roll out globally today) Browser extension for Firefox or Chrome A few important details: Privacy Pass implementation is fully open-sourced for transparency and community collaboration Kagi Privacy Pass is available for the Professional, Ultimate, Family, and Team plans. Limitations: It’s not available for Trial/Starter plans Privacy Pass mode disables account-specific features, like domain personalisaton (as we do not know which user is searching) Privacy Pass mode is supported for Kagi Search in this initial phase, we will add support other services in the coming weeks. Check the blog post FAQ section for more details! Kagi is redefining privacy in search. Try it now! Kagi Tor Onion service Access Kagi securely and anonymously via our new Tor Onion Service. You can now access Kagi directly through the Tor network using our dedicated onion address: kagi2pv5bdcxxqla5itjzje2cgdccuwept5ub6patvmvn3qgmgjd6vid.onion See more information about Kagi Tor service in our documentation. An updated comprehensive privacy policy We’re also excited to share our updated privacy policy, with simplified language and designed to clearly reflect our strong commitment to protecting your privacy. Improvements and bug fixes Search Quick Peek Breaks after Clicking More Results #6016 @silvenga Add option to save the mobile advanced settings state #5826 @miicat_47 Analytics for kids accounts do not show name/username #6189 @kingtong News search sorting issue #6187 @Alastair Kagi maps finds Texas in Alaska #5904 @Boomkop3 News Search has no headings, not accessible to blind users #6140 @gordon582 Escaping
in IP Widget #6225 @silvenga Ranking adjustment info popup contains raw HTML #6241 @frereit Quick answer code snippets display broken (sometimes) #6224 @lelovsky Images widget blank on mobile #6243 @tuesday Kagi Assistant Kagi Assistant now supports O3 Mini, Gemini 2.0 Flash, and R1 Distill Llama 70B! Announce new results from assistant using ARIA Live #5996 @fastfinge Allow to hide thought process for reasoning LLMs #6025 @KamilKurde Add o3-mini model to Kagi Assistant #6135 @RegChien Formatting issues in Reasoning for R1 in Assistant #6148 @hansihe Assistant should wait with naming a thread until it’s finished if not enough info #5828 @Thibaultmol Space missing below reference list in Assistant downloaded transcript #5833 @thomasjsn Allow for disabling the CTRL + SHIFT + Backspace shortcut #5550 @4P5mc Images cannot be pasted into the Assistant on WebKit browsers #6193 @laiz Info panel not visible for new Assistant UI #6143 @azdanov The assistant prompt “think about tag” breaks auto-hiding of reasoning text #6181 @ssg Kagi Android App Privacy Pass support! You can add the Privacy Pass shortcut by holding the Kagi Android icon. The app returns to the main screen every time it re-renders (e.g., entering/leaving split screen) #5875 @Philippe_Choquette Assistant via Android app doesn’t work with multi file uploads #5959 @Thibaultmol Kagi on Socials Here is this week’s featured social media post: Tag our accounts or use #Kagi when mentioning us in your posts! Kagi in the News Orion tops Apple’s App Store’s list of superpowered internet browsers “to seriously level up your web browsing”! And Android Police published an article about Kagi’s new fair pricing model: “This ethical search engine will return your subscription money if you don’t use it.” The Verge also covered the news.