🤖 Happy FOSAI Friday! 🚀
Friday, September 29, 2023
HyperTech News Report #0002
Hello Everyone!
Welcome back to the HyperTech News Report! This week we’re seeing some really exciting developments in futuristic technologies. With more tools and methods releasing by the day, I feel we’re in for a renaissance in software. I hope hardware is soon to follow… but I am here for it! So are you. Brace yourselves. Change is coming! This next year will be very interesting to watch unfold.
Table of Contents
Community Changelog
- Cleaned up some old content (let me know if you notice something that should be archived or updated)
Image of the Week
This image of the week comes from a DALL-E 3 demonstration by Will Depue. This depicts a popular image for diffusion models benchmarks - the astronaut riding a horse in space. Apparently this was hard to get right, and others have had trouble replicating it - but it seems to have been generated by DALL-E 3 nevertheless. Curious to see how it stacks up against other diffusers when its more widely available.
New Foundation Model!
There have been many new models hitting HuggingFace on the daily. The recent influx has made it hard to benchmark and keep up with these models - so I will be highlighting a hand select curated week-by-week, exploring these with more focus (a few at a time).
If you have any model favorites (or showcase suggestions) let me know what they are in the comments below and I’ll add them to the growing catalog!
This week we’re taking a look at Mistral - a new foundation model with a sliding attention mechanism that gives it advantages over other models. Better yet - the mistral.ai team released this new model under the Apache 2.0 license. Massive shoutout to this team, this is huge for anyone who wants more options (commercially) outside of Llama 2 and Falcon families.
From Mistralai:
The best 7B, Apache 2.0… Mistral-7B-v0.1 is a small, yet powerful model adaptable to many use-cases. Mistral 7B is better than Llama 2 13B on all benchmarks, has natural coding abilities, and 8k sequence length. It’s released under Apache 2.0 licence, and we made it easy to deploy on any cloud.
Mistralai
- https://huggingface.co/mistralai/Mistral-7B-v0.1
- https://mistral.ai/news/announcing-mistral-7b/
- https://docs.mistral.ai/quickstart/
TheBloke (Quantized)
- https://huggingface.co/TheBloke/Mistral-7B-v0.1-GGUF https://huggingface.co/TheBloke/Mistral-7B-v0.1-GPT
More About GPTQ
More About GGUF
Metaverse Developments
Mark Zuckerberg had his third round interview on the Lex Fridman podcast - but this time, in the updated Metaverse. This is pretty wild. We seem to have officially left uncanny valley territory. There are still clearly bugs and improvements to be made - but imagine the possibilities of this mixed reality technology (paired with VR LLM applications).
The type of experiences we can begin to explore in these digital realms are going to evolve into things of true sci-fi in our near future. This is all very exciting stuff to look forward to as AI proliferates markets and drives innovation.
What do you think? Zuck looks more human in the metaverse than in real life… mission… success?
Click here for the podcast episode.
NVIDIA NeMo Guardrails
If you haven’t heard about NeMo Guardrails, you should check it out. It is a new library and approach for aligning models and completing functions for LLMs. It is similar to LangChain and LlamaIndex, but uses an in-house developed language from NVIDIA called ‘colang’ for configuration, with NeMo Guardrail libraries in python friendly syntax.
This is still a new and unexplored tool, but could provide some interesting results with some creative applications. It is also particularly powerful if you need to align enterprise LLMs for clients or stakeholders.
Tutorial Highlights
Mistral 7B - Small But Mighty 🚀 🚀
Chatbots with RAG: LangChain Full Walkthrough
NVIDIA NeMo Guardrails: Full Walkthrough for Chatbots / AI
Author’s Note
This post was authored by the moderator of !fosai@lemmy.world - Blaed. I make games, produce music, write about tech, and develop free open-source artificial intelligence (FOSAI) for fun. I do most of this through a company called HyperionTechnologies
a.k.a. HyperTech
or HYPERION
- a sci-fi company.
Thanks for Reading!
If you found anything about this post interesting, consider subscribing to !fosai@lemmy.world where I do my best to keep you informed about free open-source artificial intelligence as it emerges in real-time.
Our community is quickly becoming a living time capsule thanks to the rapid innovation of this field. If you’ve gotten this far, I cordially invite you to join us and dance along the path to AGI and the great unknown.
Come on in, the water is fine, the gates are wide open! You’re still early to the party, so there is still plenty of wonder and discussion yet to be had in our little corner of the digiverse.
This post was written by a human. For other humans. About machines. Who work for humans for other machines. At least for now…
Until next time!
I totally forgot to include vLLM!
If you’re building, deploying, or hosting LLMs, you should definitely check this out.
https://github.com/vllm-project/vllm