Sean Breeden :: Full Stack PHP, Python, AI/ML Developer

Artificial Intelligence

AI News 2/24/2024

Saturday, February 24th, 2024

Stable Diffusion 3: https://stability.ai/news/stable-diffusion-3 - Announcing Stable Diffusion 3 in early preview, our most capable text-to-image model with greatly improved performance in multi-subject prompts, image quality, and spelling abilities. While the model is not yet broadly available, today, we are opening the waitlist for an early preview. This preview phase, as with previous models, is crucial for gathering insights to improve its performance and safety ahead of an open release. You can sign up to join the waitlist here.

Groq: https://wow.groq.com/ - Groq®! created the LPU™ Inference Engine - the first and fastest of its kind - serving the real-time AI market. Our solution for inference (not training) makes us the AI performance leader, in regards to speed and precision, in the compute center. Unlike other providers, we aren't brokering a cloud service. We built our own chip, compiler and software, systems, and GroqCloud™. Our first-gen GroqChip™, a Language Processing Unit™ (LPU), is a new processor category.

Gemma: https://blog.google/technology/developers/gemma-open-models/ - Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models. Developed by Google DeepMind and other teams across Google, Gemma is inspired by Gemini, and the name reflects the Latin gemma, meaning “precious stone.” Accompanying our model weights, we’re also releasing tools to support developer innovation, foster collaboration, and guide responsible use of Gemma models.

Google Gemini: https://gemini.google.com/app - Google says its AI image-generator would sometimes ‘overcompensate’ for diversity. The partial explanation for why its images put people of color in historical settings where they wouldn’t normally be found came a day after Google said it was temporarily stopping its Gemini chatbot from generating any images with people in them. That was in response to a social media outcry from some users claiming the tool had an anti-white bias in the way it generated a racially diverse set of images in response to written prompts. (Source: AP News)

ElevenLabs: https://elevenlabs.io/ - AI voice cloning startup ElevenLabs was co-founded by former Google machine-learning engineer Piotr Dabkowski and ex Palantir deployment strategist Mati Staniszewski in 2022, and has since launched AI-powered text-to-speech software and an AI dubbing tool designed to auto translate speech from a video into more than 20 languages that "maintains the original voice tone and style." Now the company is working on something new, which can reportedly generate sounds to accompany otherwise silent video footage based on descriptions of a scene given by a user. (Source: https://newatlas.com/technology/elevenlabs-sound-effects-ai-audio/)

(image generated by DALL-E on ChatGPT-4)

artificialIntelligence