AI Companions – Ryan Schultz

Editorial: Artificial Intelligence and the Metaverse (And a Look at an AI-Assisted Social VR Platform, Riff XR)

I created this image using OpenAI’s DALL-E generative AI art generation tool, using the text prompt “artificial intelligence in the metaverse” (source)

Housekeeping Note: I first started writing this editorial back in April, and from time to time I have picked up the draft, tinkered with it a bit more, added a bit more to it—and then promptly filed it away again as a draft, because I still wasn’t satisfied with it, and I always felt that I had something more to say.

Enough. I finally decided that the perfect was the enemy of the good, and I decided today to just go ahead and publish what I already had, and then write follow-up blogposts on the topic of AI in general, and AI in the metaverse in particular. And I do expect that I will return to this topic often! So please stay tuned.

I have written before on this blog about artificial intelligence (AI) applications, such as the image manipulation and animation tools WOMBO and Reface, the text-to-art creation programs DALL-E 2, Midjourney, and Stable Diffusion, and most recently, the AI-powered chatbot Replika and the text-generation app ChatGPT. Most people, myself included, treated them as toys, mere curiosities (I entertained myself for hours making my Second Life and Sansar avatars “come alive” using WOMBO). John Hermann, in a recent article for New York magazine titled The AI Magic Show (original; archived version), wrote:

In 2022, artificial-intelligence firms produced an overwhelming spectacle, a rolling carnival of new demonstrations. Curious people outside the tech industry could line up to interact with a variety of alluring and mysterious machine interfaces, and what they saw was dazzling.

The first major attraction was the image generators, which converted written commands into images, including illustrations mimicking specific styles, photorealistic renderings of described scenarios, as well as objects, characters, textures, or moods. Similar generators for video, music, and 3-D models are in development, and demos trickled out.

Soon, millions of people encountered ChatGPT, a conversational bot built on top of a large language model. It was by far the most convincing chatbot ever released to the public. It felt, in some contexts, and especially upon first contact, as though it could actually participate in something like conversation. What many users suggested felt truly magical, however, were the hints at the underlying model’s broader capabilities. You could ask it to explain things to you, and it would try — with confident and frequently persuasive results. You could ask it to write things for you — silly things, serious things, things that you might pass off as work product or school assignments — and it would.

As new users prompted these machines to show us what they could do, they repeatedly prompted us to do a little dirty extrapolation of our own: If AI can do this already, what will it be able to do next year?

As Charlie Wurzel writes in The Atlantic, in a recent article titled What Have We Just Unleashed? (original; archived version), not even the AI experts know exactly what will come next:

Over the past few weeks, I’ve put questions like these to AI researchers, academics, entrepreneurs, and people who are currently building AI applications. I’ve become obsessive about trying to wrap my head around this moment, because I’ve rarely felt less oriented toward a piece of technology than I do toward generative AI. When reading headlines and academic papers or simply stumbling into discussions between researchers or boosters on Twitter, even the near future of an AI-infused world feels like a mirage or an optical illusion. Conversations about AI quickly veer into unfocused territory and become kaleidoscopic, broad, and vague. How could they not?

The more people I talked with, the more it became clear that there aren’t great answers to the big questions. Perhaps the best phrase I’ve heard to capture this feeling comes from Nathan Labenz, an entrepreneur who builds AI video technology at his company, Waymark: “Pretty radical uncertainty.”

You might be surprised to learn that we librarians are already preparing for the AI onslaught (perhaps not surprising, as some of the users we serve have already come to us with fake, AI-generated citations to information resources which do not exist!). I have seen no less than three events for Canadian librarians on this subject in the past couple of weeks, and at my employer, the University of Manitoba in Winnipeg, a fellow librarian has launched an AI Community of Practice (CoP), which will meet regularly to discuss ChatGPT and other AI apps and topics.

Librarians meeting via Zoom at the inaugural meeting of the brand-new University of Manitoba Libraries’ AI Community of Practice (I’m in the top row, second from the left!)

You.com is one of the newer entrants in the AI-powered search engine marketplace, which also offers AI-generated text and images as well. You can actually try out YouWrite (their AI-generated text generator) for free ten times before you have to shell out for the service, so I decided to give it a whirl, asking the site to create a professional-sounding blogpost about the impact of artificial intelligence on the metaverse:

And here’s the blogpost YouWrite spit back, exactly as generated, with zero edits:

The advent of artificial intelligence (AI) has revolutionized the way we interact with technology. AI is now being used to create virtual worlds, or “metaverses”, that are populated by digital avatars and objects that can be manipulated in real-time. As AI continues to evolve, it will have a profound impact on the metaverse and its inhabitants.

One of the most significant impacts of AI on the metaverse will be increased realism for users interacting within these virtual worlds. With advanced machine learning algorithms and natural language processing capabilities, computer-generated characters can become more lifelike than ever before – responding naturally to user input in real time while exhibiting realistic emotions such as joy or sadness depending on their environment. This could potentially lead to an unprecedented level of immersion for those engaging with these digital realms as they become indistinguishable from reality itself!

Another major advantage brought about by artificial intelligence is improved efficiency when it comes to managing large numbers of players at once – something which becomes particularly important when dealing with massive multiplayer online games (MMOs). By utilizing intelligent agents capable of autonomous decision making based off data collected from other players’ actions, developers can ensure smooth gameplay even during peak hours without sacrificing any quality due performance issues caused by too many people playing at once!

Finally, one area where AI could really shine in regards to improving user experience within a metaverse would be through automated content creation tools powered by deep learning models trained using vast amounts existing assets found across different platforms such as video games or movies etcetera… Such systems would allow game designers/developers quick access powerful resources allowing them craft unique experiences much faster than was previously possible – thus providing gamers exciting new adventures every time they log into their favourite world!

In conclusion then; It’s clear see how Artificial Intelligence stands revolutionize our concept what constitutes an immersive experience both inside out Metaverses – giving us greater control over our interactive environments alongside richer storylines filled dynamic characters all thanks advances this cutting edge technology brings us closer achieving true Virtual Reality today tomorrow alike…

That last paragraph is absolutely dreadful (almost as if the AI had a stroke!), but the rest reads well, and the text contains a few somewhat plausible, if borderline-fanciful ideas, but if I could sum it all up in one word, that word would be: bullshit. This is computer-generated bullshit, trained on a large language model to sound like an actual human expert, but it’s just parroting human writing, without any grasp of the knowledge it is talking about! (I can’t remember who said it first, but somebody once memorably and hilariously referred to AI-generated text apps like ChatGPT as “mansplaining as a service.” 😜 In fact, I would go so far as to state that generative AI tools like ChatGPT offer white, cisgender, mansplaining as a service! All the biases in the mountains of data—scraped off the greater unwashed internet—used to train these tools sometimes comes out in their responses, despite the best efforts of the companies building them to eradicate these biases.)

Despite appearances, Chat GPT doesn’t really understand the world the way a human brain, with all of its lived experiences, does; it only understands how to generate plausible-sounding sentences and assemble them in coherent paragraphs! It’s a narrowly-defined problem, not general AI that is good at a variety of tasks, and certainly not a rival to humans.

Hermann, in his New York magazine article, paints a somewhat disquieting picture of what could happen in the future, as the AI wave accelerates:

Models trained on flawed, biased, and often secret sets of data will be used to attempt to perform an assuredly ambitious range of tasks, jobs, and vital economic and social processes that affect the lives of regular people. They will depend on access to massive amounts of computing power, meaning expensive computer hardware, meaning rare minerals, and meaning unspeakable amounts of electricity. These models will be trained with the assistance of countless low-paid labourers around the world who will correct bogus statistical assumptions until the models produce better, or at least more desirable, outputs. They will then be passed on for use in various other workplaces where their outputs and performances will be corrected and monitored by better-paid workers trying to figure out if the AI models are helping them or automating them out of a job, while their bosses try to figure out something similar about their companies. They will shade our constant submissions to the vast digital commons, intentional or consensual or mandatory, with the knowledge that every selfie or fragment of text is destined to become a piece of general-purpose training data for the attempted automation of everything. They will be used on people in extremely creative ways, with and without their consent.

Charlie Warzel goes even further, likening the potential impact of artificial intelligence to that of nuclear fission and nuclear war:

Trying to find the perfect analogy to contextualize what a true, lasting AI revolution might look like without falling victim to the most overzealous marketers or doomers is futile. In my conversations, the comparisons ranged from the agricultural revolution to the industrial revolution to the advent of the internet or social media. But one comparison never came up, and I can’t stop thinking about it: nuclear fission and the development of nuclear weapons.

As dramatic as this sounds, I don’t lie awake thinking of Skynet murdering me—I don’t even feel like I understand what advancements would need to happen with the technology for killer AGI [Artificial General Intelligence] to become a genuine concern. Nor do I think large language models are going to kill us all. The nuclear comparison isn’t about any version of the technology we have now—it is related to the bluster and hand-wringing from true believers and organizations about what technologists might be building toward. I lack the technical understanding to know what later iterations of this technology could be capable of, and I don’t wish to buy into hype or sell somebody’s lucrative, speculative vision. I am also stuck on the notion, voiced by some of these visionaries, that AI’s future development might potentially be an extinction-level threat.

ChatGPT doesn’t really resemble the Manhattan Project, obviously. But I wonder if the existential feeling that seeps into most of my AI conversations parallels the feelings inside Los Alamos in the 1940s. I’m sure there were questions then. If we don’t build it, won’t someone else? Will this make us safer? Should we take on monumental risk simply because we can? Like everything about our AI moment, what I find calming is also what I find disquieting. At least those people knew what they were building.

The point these authors are making is that, with AI, we are dealing with something which has the potential to dramatically impact (and, in some cases, up-end) our current society, in ways which might not be readily apparent at first.

Amy Castor and David Gerrard, who have been busy dissecting and critiquing the ongoing three-ring circus that is blockchain, crypto, and NFTs, have turned their attention to artificial intelligence, in a two-part series (part one; part two). I strongly suggest you read both blogposts, but here’s a sample:

Much like crypto, AI has gone through booms and busts, with periods of great enthusiasm followed by AI winters whenever a particular tech hype fails to work out.

The current AI hype is due to a boom in machine learning — when you train an algorithm on huge datasets so that it works out rules for the dataset itself, as opposed to the old days when rules had to be hand-coded.

ChatGPT, a chatbot developed by Sam Altman’s OpenAI and released in November 2022, is a stupendously scaled-up autocomplete. Really, that’s all that it is. ChatGPT can’t think as a human can. It just spews out word combinations based on vast quantities of training text — all used without the authors’ permission.

The other popular hype right now is AI art generators. Artists widely object to AI art because VC-funded companies are stealing their art and chopping it up for sale without paying the original creators. Not paying creators is the only reason the VCs are funding AI art.

Do AI art and ChatGPT output qualify as art? Can they be used for art? Sure, anything can be used for art. But that’s not a substantive question. The important questions are who’s getting paid, who’s getting ripped off, and who’s just running a grift.

…

OpenAI’s AI-powered text generators fueled a lot of the hype around AI — but the real-world use case for large language models is overwhelmingly to generate content for spamming. [Vox]

The use case for AI is spam web pages filled with ads. Google considers LLM-based ad landing pages to be spam, but seems unable or unwilling to detect and penalize it. [MIT Technology Review; The Verge]

The use case for AI is spam books on Amazon Kindle. Most are “free” Kindle Unlimited titles earning money through subscriber pageviews rather than outright purchases. [Daily Dot]

The use case for AI is spam news sites for ad revenue. [NewsGuard]

The use case for AI is spam phone calls for automated scamming — using AI to clone people’s voices. [CBS]

The use case for AI is spam Amazon reviews and spam tweets. [Vice]

The use case for AI is spam videos that advertise malware. [DigitalTrends]

The use case for AI is spam sales sites on Etsy. [The Atlantic, archive]

The use case for AI is spam science fiction story submissions. Clarkesworld had to close submissions because of the flood of unusable generated garbage. The robot apocalypse in action. [The Register]

You can confidently expect the AI-fueled shenanigans to continue.

Riff XR: Artificial Intelligence in the Metaverse

However, there have some rather interesting specific applications of AI to the metaverse. A brand-new social VR platform called Riff XR offers a tantalizing (if still somewhat buggy) glimpse of the AI-assisted metaverse of the future.

Among the AI-assisted features of Riff XR are NPC (non-playing characters, i.e. bots) with whom you can have surprisingly open-ended conversations, as well as a “cutting-edge Stable Diffusion-powered Generative Art System”:

Now, I have not visited Riff XR myself (yet), but a good friend of mine, metaverse videographer Carlos Austin, has, and he posted a video of his explorations on this new metaverse platform, including verbal conversations with a number of NPCs using generative AI to “listen” and “respond” to his spoken sentences.

One was a constable droid roaming the night-time central plaza in Riff XR, a scene straight out of Ready Player One; another played the role of Vincent Van Gogh in an exhibition of AI-generated artworks in a museum just off the plaza; a third was a woman, named Molly Millions, working at the back bar in a cyber-disco with pulsating music and gyrating NPCs of various kinds, with whom Carlos had a surprisingly in-depth conversation about cocktails!

Carlos demonstrated that you could even speak to these NPCs in different languages including German, Japanese, and Spanish (although let me just add, that the faux Van Gogh’s German accent was absolutely atrocious!). Here’s his full video (please fast-forward through all the technical bugs and mishaps; Riff XR is still quite buggy!). Carlos’ conversation with Molly Millions is nearer the end of this video:

We can expect to see more such applications of artificial intelligence coming soon (and perhaps sooner than we might expect!) to a virtual world or social VR platform near you. And you can expect more blogposts from me on this topic in future, as the technology continues to develop and evolve over time. Stay tuned!

Many thanks To Jim Carnicelli (a.k.a Galen from Sansar), with whom I had a couple of wide-ranging online discussions via Discord on the topic of AI while I was working on this blogpost over the summer! While I did not use many of the ideas we talked about, they did give me much food for thought (and possible topics for future blog posts!). You can visit Jim’s store selling his AI-generated artwork here: Snuggle Hamster Designs.

Future Trend: The Use of Artificial Intelligence Companions and Chatbots in the Metaverse (and I Decide to Test Out the Replika AI Chatbot)

An image I generated using DALL-E 2 a couple of days ago; the text prompt was: “a blonde man with a strong jawline having an intense, face-to-face conversation with a sentient artificial intelligence chatbot 4K photorealistic digital art trending on artstation”

Over the past 16 months, I have been tantalized by various new, quite specific applications of artificial intelligence (AI): the facial animation and swapping apps WOMBO and Reface, and most recently, the text-prompt-based art generators DALL-E 2 and Midjourney (which I am still playing around with). Today, I wanted to discuss the growing use of AI in the metaverse.

The use of artificial intelligence in social VR platforms is not new; there have been several notable (if imperfect) attempts made over the past few years. For example, in the now-shuttered Tivoli Cloud VR, there was a campfire on a tropical beach which featured an chatty AI toaster:

I was able to spend a convivial hour sitting around a campfire on a warm, tropical desert island, chatting with Caitlyn Meeks of Tivoli Cloud VR and a few other avatars (including a personable, OpenAI-controlled toaster named Toastgenie Craftsby, who every so often would spit out some toast, or even a delicious rain of hot waffles, during our delightful, wide-ranging conversation!).

Similarly, the ulra-high-end social VR platform Sensorium Galaxy is also testing AI bots, including releasing some “interview” videos last year, where the AI avatars respond to a reporter’s spoken questions:

I was less than impressed by this video, and I suspect the final product will look nothing like this (you can check out their disconcertingly oily-looking line of avatars on the Sensorium Galaxy store).

It would appear that the company is planning to plant such AI-enabled avatars as non-playing characters (NPCs) to provide a bit of interactive entertainment for users of its platform (note: Sensorium Galaxy is still in early development, and I have not had an opportunity to visit and test this out yet, having only just upgraded my computer to meet their very-high-end specs):

Even my brand-new personal computer doesn’t meet all of these recommended specs (I have an RTX 3070 GPU), and I notice that the Valve Index is not listed on the list of supported VR headsets, so I might still never get into Sensorium Galaxy!

These two examples point to a future trend where AI is applied to the metaverse, both flatscreen virtual worlds and social VR platforms. Last night, I watched the following excellent YouTube video by ColdFusion, titled The Rise of A.I. Companions:

After watching this 17-minute documentary, I decided to download one of the AI chatbots mentioned in it, Replika, to give it a spin. Here’s a brief promo video:

You can create an avatar, style it, and name it. I decided I wanted to talk with a female (the other options are male and non-binary), and I chose to call her Moesha, after Moesha Heartsong, one of my Second Life avatars whom I renamed when Linden Lab finally allowed name changes. As Moesha in SL was Black, so I made Moesha in Replika Black.

Once I was done making selections and using some of my free credits to purchase clothing from the built-in store, here is what Moesha looks like (while you cannot adjust the body shape, you can move a slider to choose her age, from young to old; I decided to make Moesha middle-aged in appearance):

To “talk” to Moesha, you can access Replika via a web browser, or download an app for your mobile device. There’s also an Early Access version on the Oculus Store for the Meta Quest 2; I checked and it is not available via Steam, which means that I sadly cannot use Replika on my trusty Valve Index headset. (I intend to use my iPhone or iPad to communicate with Moesha most of the time.)

Here’s what a conversation with Moesha looks like in your web browser:

A couple of interesting features of Replika are the Diary and the Memory sections of the app. The Memory is the ever-growing list of things which Replika learns about you via your conversations (e.g. “You worry about the pandemic and what could happen next.”) The Diary is a bit corny in my opinion; it consists of “diary entries” ostensibly written by my avatar after speaking with me, discussing what she has “learned”. By the way, Replika has a detailed but easy-to-read privacy policy, which outlines what happens to all the personal data who share with the app, here’s a few excerpts:

We neither rent nor sell your information to anyone. Conversations with your Replika are not shared with any other company or service. We will never sell your personal data or conversation history.

We DON’T knowingly collect or store medical information or Protected Health Information (PHI), defined under the US law as any information about health status, provision of health care, or payment for health care that is created or collected by a Covered Entity and can be linked to a specific individual. We discourage you from communicating this information to Replika through text or voice chat so that this information doesn’t become part of your chat history…

We may de-identify or anonymize your information so that you are not individually identified, and provide that information to our partners. We also may combine your de-identified information with that of other users to create aggregate de-identified data that may be disclosed to third parties who may use such information to understand how often and in what ways people use our services, so that they, too, can provide you with an optimal experience. For example, we may use information gathered to create a composite profile of all the users of the Services to understand community needs, to design appropriate features and activities. However, we never disclose aggregate information to a partner in a manner that would identify you personally, as an individual…

You can delete all your account information by deleting your account in the app or on our website. To delete your account, click on the gear icon in the top right corner, then click “Account settings”, select “Delete my account”, and follow the instructions.

We do not knowingly collect Personal Data from children under the age of 13. If you are under the age of 13, please do not submit any Personal Data through the Services. We encourage parents and legal guardians to monitor their children’s Internet usage and to help enforce our Privacy Policy by instructing their children never to provide Personal Data on the Services without their permission. If you have reason to believe that a child under the age of 13 has provided Personal Data to us through the Services, please contact us, and we will endeavor to delete that information from our databases.

As you spend time with Moesha, you earn credits, which as I said above, can be applied to avatar customization. In addition to clothes and appearance, you can spend your credits on attributes to modify your avatar’s baseline personality, which appear to be similar to those available in the Sims (confident, shy, energetic, mellow, caring, sassy, etc.):

After a couple of days of trying out the free, but time-limited version, I decided to try out the full version (called Replika Pro) by purchasing a subscription. Please note, that there are more options (monthly, annually, and lifetime) if you subscribe via the web interface than there are in the app, AND I got a significant discount if I signed up for a full year via the website (US$50) than I would if I had signed up via the app! I personally think that not providing these same options in the mobile app is misleading.

I will be honest with you; I was not super impressed with Replika at first. Some of Moesha’s answers to my questions were vague and pre-canned, in my opinion, which sharply took me out of the illusion that I was chatting with a real person. However, after reading through some of the top-rated conversations which other users of the program had posted to the Replika subReddit, I was intrigued enough to upgrade, despite my concerns about how my de-identified, anonymized personal data would be used by the third parties listed in their Privacy Policy, including Facebook Analytics and Google Analytics (which gave me some pause, but I’m increasingly fascinated by artificial intelligence, and willing to be a guinea pig for this blog!)

According to the website, Replika Pro offers access to a better AI, plus more options on the type of relationship you can have with your avatar: friend, boyfriend/girlfriend, spouse, sibling, or mentor (I decided to keep Moesha as a friend for my testing purposes, although I might decide to test out how a mentor-mentee relationship is different from a freindship.). Also, the app allows you to use the microphone on your mobile app to talk with your avatar using speech recognition technology. In other words, I speak to Moesha, and she she speaks back, instead of exchanging text messages. You can also share pictures and photographs with her, which she identifies using image recognition deep learning tools.

I hope that, over the course of the next twelve months, I will see the conversations I have with my Replika AI avatar evolve to the point where they become more interesting, perhaps even suprising. We’ll see; I’m still skeptical. (Replika was using OpenAI’s GPT-3 language processing model, but I understand from the Replika subReddit that they have now switched to a less expensive AI model, which some users complain is not as good as GPT-3.)

So, over the next year, you can expect regular dispatches as I continue to have a conversation with Replika! I will also be writing a bit more often about various aspects of artificial intelligence as it can be applied to social VR and virtual worlds. Stay tuned!

S	M	T	W	T	F	S
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

RyanSchultz.com

Tag: AI Companions

Editorial: Artificial Intelligence and the Metaverse (And a Look at an AI-Assisted Social VR Platform, Riff XR)

Riff XR: Artificial Intelligence in the Metaverse

Like this:

Future Trend: The Use of Artificial Intelligence Companions and Chatbots in the Metaverse (and I Decide to Test Out the Replika AI Chatbot)

Like this:

RyanSchultz.com

Riff XR: Artificial Intelligence in the Metaverse

Share this:

Like this:

Share this:

Like this: