Just a few days ago, OpenAI released its new model, GPT-4o, for free.
This new model is setting a new milestone. It can engage in fluent conversations, generate images, serve as an excellent tutor, offer new opportunities for people with visual impairments, and more. All of this with a response quality that will amaze you.
Here are 10 ways you can use ChatGPT-4o to make it your favorite personal assistant.
1. GPT-4o’s voice and camera features takes talking with AI to the next level
Talking with GPT-4o feels more human. It listens to you, gives responses smoothly and it can even see what’s surrounding you. You can also modulate the tone of its voice, offering more options to simulate different scenarios.
Another cool thing is that you can add another person to the conversation (more about this in the next use cases below) and even another AI! Let's see how Greg Brockman interacts with GPT-4o, and then adds another AI to the conversation seamlessly.
This tool can assist with personal interviews, job interviews, learning a new language, and more (GPT-4o now supports translation in over 20 languages).
2. AI that understands context
Most AIs on the market aim to provide responses as a human would, meaning they need to understand context on a deeper level. In this regard, GPT-4o is incredible.
In the following example, we can see how GPT-4o understands the context of a job interview and knows the parameters we need to meet regarding our personal presentation. But here's the impressive part—not only does it understand the context, but when generating a response, it pays attention to every detail in the conversation and offers a genuine reply, just like any of us would.
3. GPT-4o as a meeting facilitator
Often, we attend work meetings with a set agenda, but it's common to run over time and spend more than we planned. Now we can use GPT-4o as a meeting facilitator. It can keep track of our agenda, understand the context, guide the conversation effectively, and provide a summary of the meeting.
Additionally, we can instruct it to take notes on specific agreements made during our meeting.
4. GPT-4o can become your math tutor
We can now use GPT-4o as our personal tutor. Thanks to its multimodal capabilities, it can guide us in detail through various subjects of interest.
This has a valuable impact on education, as GPT-4o introduces AI as a new tool for acquiring knowledge, in contrast to traditional methods.
Here’s how an X user used ChatGPT to solve math problems.
GPT-4o explains each step clearly and finds the best way for us to understand it.
5. GPT-4o can provide customer service
With GPT-4o, you can manage various real-time tasks focused on customer service. In the video below, you’ll see how one GPT-4o provides customer service while the other calls on behalf of a customer.
The next interesting step would be to integrate GPT-4o with different applications. This would take its capabilities as a personal assistant to a whole new level.
6. AI assistance for vision impairment
When it comes to social inclusion, GPT-4o has truly excelled. You can now use its vision mode to interact directly with real-world environments. This will help people with vision impairments by providing a tool that guides them in a unique way. It describes their surroundings and answers questions in real-time.
Here’s an example that really impressed me.
7. Text extraction from images
Now you can easily extract text from PDF files using GPT-4o. Just upload the PDF file you want to use and provide a prompt like the one below.
Extract this recipe to [choose the file type]
Here’s an example.
8. Get insights from datasets
If we want to get analysis and insights through visualizations, we can do it with GPT-4o. Simply upload a dataset or spreadsheet and enter the following prompt:
Analyze this spreadsheet. Do deep technical and statistical analysis on this. Generate charts and visualizations
Let’s see how an X user analyzed the sales of a shoe company.
9. Coding
When it comes to coding, the precision and speed of the responses have been improved, allowing us to get lines of code in a matter of seconds.
Let's look at an example where Sawyer Hood recreated a chat similar to a messenger app in HTML.
Here’s another example of how a small video game was built.
It’s worth noting that GPT-4o can perform code testing and integrate with code editors. Here’s an example.
10. Image generation
GPT-4o can help you create images with a prompt or from another image you upload (it still has a way long to go, though)
Let’s see what we can get by using the following prompt.
A friendly-looking robot wearing a baseball cap standing in an upright pose facing the camera. it has a smile on its face.
Here’s the image I got.
Here’s a caricature an X user got after uploading one of his pictures.
That’s it! If you found another cool use case, share it in the comment section!
I've used GPT-4o for a month and here are my impressions:
1. The accuracy of generated content has improved significantly, by about 40%, especially in translation.
2. It has updated the multimodal framework based on GPT-4.
3. The ability to analyze uploaded file data is particularly excellent.
4. The programming code capabilities remain outstanding, provided the requirements are clearly described.