Tools
Fine-Tuning GPT-4o for Revolutionary Performance
Fine-tuning GPT-4o offers developers the ability to customise AI models for specific tasks, enhancing performance and accuracy.
Published
6 months agoon
By
AIinAsia
TL;DR:
- Fine-tuning GPT-4o allows developers to customise AI models for specific tasks, enhancing performance and accuracy.
- Cosine’s Genie achieved a state-of-the-art score of 43.8% on the SWE-bench Verified benchmark using fine-tuned GPT-4o.
- Distyl ranked 1st on the BIRD-SQL benchmark with a 71.83% execution accuracy using fine-tuned GPT-4o.
In the rapidly evolving world of artificial intelligence (AI), the ability to fine-tune models has become a game-changer. Today, we’re thrilled to announce the launch of fine-tuning for GPT-4o, a feature that developers have been eagerly awaiting. This new capability allows developers to customise GPT-4o models with their own datasets, leading to higher performance and lower costs for specific use cases. Let’s dive into what this means for the future of AI in Asia and beyond.
What is Fine-Tuning and Why Does It Matter?
Fine-tuning is the process of training a pre-trained AI model on a new dataset to adapt it to a specific task. For GPT-4o, this means developers can now tailor the model to their unique needs, whether it’s coding, creative writing, or any other domain-specific application. This customisation can significantly improve the model’s performance and accuracy, making it more efficient and cost-effective.
Getting Started with GPT-4o Fine-Tuning
To start fine-tuning GPT-4o, developers can visit the fine-tuning dashboard and select the base model they want to customise. GPT-4o fine-tuning is available to all developers on paid usage tiers, with costs starting at $25 per million tokens for training and $3.75 per million input tokens for inference.
For those looking to experiment without a significant investment, GPT-4o mini fine-tuning is also available. This version offers 2 million training tokens per day for free until September 23, making it an excellent starting point for developers to test the waters.
Achieving State-of-the-Art Performance
Over the past few months, we’ve collaborated with trusted partners to test fine-tuning on GPT-4o. The results have been impressive. Here are a couple of success stories:
Cosine’s Genie: A Software Engineering Marvel
Cosine’s Genie is an AI software engineering assistant that can autonomously identify and resolve bugs, build features, and refactor code. Powered by a fine-tuned GPT-4o model, Genie has achieved a state-of-the-art score of 43.8% on the new SWE-bench Verified benchmark. This is a significant improvement over previous models, demonstrating the power of fine-tuning.
“Genie is powered by a fine-tuned GPT-4o model trained on examples of real software engineers at work, enabling the model to learn to respond in a specific way.”
- Cosine
Distyl: Leading the Way in Text-to-SQL
Distyl, an AI solutions partner to Fortune 500 companies, recently placed 1st on the BIRD-SQL benchmark. Their fine-tuned GPT-4o model achieved an execution accuracy of 71.83%, excelling in tasks like query reformulation, intent classification, and SQL generation. This achievement highlights the versatility and effectiveness of fine-tuned models.
Ensuring Data Privacy and Safety
Fine-tuned models remain entirely under the control of the developers, ensuring full ownership of business data, including all inputs and outputs. This means your data is never shared or used to train other models. Additionally, we’ve implemented layered safety mitigations to prevent misuse of fine-tuned models. Automated safety evaluations and usage monitoring ensure that applications adhere to our usage policies.
Prompt: Customising GPT-4o for Your Needs
Before diving into fine-tuning, it’s crucial to understand the specific needs of your application. Here’s a prompt to help you get started:
“Imagine you are a developer working on a project that requires high accuracy in text-to-SQL conversion. How would you fine-tune GPT-4o to achieve the best results for this specific task?”
This prompt encourages you to think about the unique requirements of your project and how fine-tuning can help you achieve your goals. By customising GPT-4o, you can create a model that is tailored to your specific needs, leading to better performance and efficiency.
Comment and Share:
We’d love to hear your thoughts on fine-tuning GPT-4o and how it’s transforming the AI landscape. Share your experiences and insights in the comments below. Don’t forget to subscribe for updates on AI and AGI developments.
You may also like:
- Get Access to OpenAI’s New GPT-4o Now!
- 10 Amazing GPT-4o Use Cases
- 7 GPT-4o Prompts That Will Blow Your Mind!
- To learn more about fine tuning ChatGPT tap here.
Author
Discover more from AIinASIA
Subscribe to get the latest posts sent to your email.
You may like
Learning
Beginner’s Guide to Using Sora AI Video
This friendly guide covers features and tips to help you transform simple text prompts into visually stunning videos while using Sora AI.
Published
7 days agoon
February 14, 2025By
AIinAsia
Hello, lovely readers! If you’ve ever dreamt of creating lively, imaginative videos straight from simple text prompts, then Sora AI is about to become your new best friend. Developed by OpenAI, the Sora AI text-to-video generator lets you transform words into dynamic video content. But before you dive in, there are a few tricks of the trade that’ll help you get the most out of this cutting-edge tool.
Table of Contents
- What Is Sora AI?
- Getting Started with Sora AI
- Crafting Effective Prompts
- Advanced Features of Sora AI
- Limitations of Sora AI
- Key Differences: Original Sora vs. Sora Turbo
- Incorporating Personal Assets
- General Guidelines for All Prompts
- Category-Specific Tips
- Prompt Refinement Checklist
- Real-World Applications of Sora AI
- Conclusion & Next Steps
What Is Sora AI?
Picture this: a magical AI tool that can generate videos from simple text descriptions, courtesy of OpenAI. Much like text-to-image generators (e.g., DALL·E or MidJourney), Sora AI uses a diffusion model to take your prompt—something like “A cat playing the piano on a moonlit rooftop”—and transform it into a short video clip.
- Creative Storytelling: Sora excels at conjuring cinematic or whimsical visuals.
- Cinematic Effects: You can try out film noir, 3D animation, or even a painterly vibe.
- Animation of Still Images: Animate a static photo (say, your favourite landmark) and watch it come to life!
Do keep in mind that Sora does have its quirks:
- Human Imagery Restrictions: They’re quite cautious about privacy and ethics.
- Occasional Inconsistencies: Some videos end up looking a bit wonky—think odd proportions or peculiar motion.
Getting Started with Sora AI
1. Accessing Sora
- Head over to Sora’s official website and sign in with your OpenAI login.
- If you’re in a region where Sora’s not yet available, a VPN might come in handy.
- Choose your plan: free or paid subscription. Premium users enjoy higher resolutions and longer clips.
2. Familiarising Yourself with the Interface
- Prompting Window: The space where you type your imaginative descriptions.
- Storyboard: A timeline-like tool for building multi-scene videos.
- Blend Editor: Lets you merge and transition between multiple clips.
- Remix Tool: Tweak or reinterpret older videos with fresh prompts.
3. Setting Up Video Parameters
- Aspect Ratio: 16:9 for widescreen, 1:1 for social media squares—take your pick!
- Resolution: Going for 1080p uses more credits but looks crisp.
- Video Length: Some plans allow up to 60 seconds.
Crafting Effective Prompts
A brilliant video is only as good as the prompt you feed Sora. Here’s what works:
- Use Clear and Concise Language
- Avoid baffling jargon.
- Example: “A futuristic cityscape at night with glowing neon signs, flying cars, and a robotic figure on a rooftop.”
- Incorporate Visual Styles
- You can say “watercolour,” “stop motion,” “film noir”—Sora will adapt.
- Example: “A black-and-white film noir scene of a detective under a flickering streetlight in the pouring rain.”
- Add Camera Techniques
- Want slow motion or a panoramic sweep? Just mention it.
- Example: “A slow-motion close-up of a flower blooming in a sunny meadow.”
- Set the Mood
- Describe lighting, weather, and emotional vibes to guide the model.
- Example: “A cosy living room at dusk, with warm lighting and light rain tapping on the window.”
Advanced Features of Sora AI
Sora AI comes packed with a few extra goodies:
- Storyboard Tool
- Perfect for plotting a mini-film. Arrange scenes before generating.
- Blend Editor
- Seamlessly merge multiple video segments.
- Remix Existing Videos
- Revisit or alter older clips with new prompts or styles.
- Looping Content
- Create endless loops for social media or eye-catching GIFs.
- Image-to-Video Conversion
- Turn static images into snazzy animated clips.
- Video Extension
- Add extra frames to lengthen an existing clip.
- Text-, Image-, and Video-to-Video Inputs (Sora Turbo)
- More ways to feed Sora your creative ideas.
- Remix, Re-Cut, and Blend Tools
- Remix: Swap or update elements in an already-generated clip.
- Re-Cut: Fine-tune specific parts of the video.
- Blend: Melt different objects or scenes together for unique transitions.
Limitations of Sora AI
Like any AI tool, Sora isn’t perfect. Here are the main caveats:
- Physical Accuracy
- Expect the occasional floating chair or bizarre object movement.
- Continuity and Object Permanence
- Longer sequences can sometimes have items popping in and out randomly.
- Video Duration Caps
- Even if you pay for Pro, you might be limited to under 60 seconds.
- Resolution Constraints
- 1080p is your limit for now.
- Performance and Queue Times
- At peak hours, you may find yourself twiddling your thumbs while Sora processes.
- Ethical & Moderation Limits
- You’ll be stopped if you try to generate something too controversial or featuring humans.
- Lack of Fine-Grained Control
- Beyond your text prompt, micromanaging details is tricky.
Key Differences: Original Sora vs. Sora Turbo
There’s the original Sora and the shiny upgraded version, Sora Turbo. Here’s a quick rundown:
- Speed and Efficiency
- Sora Turbo is a proper sprinter, generating multiple clips at once.
- Video Quality and Duration
- Original Sora could handle up to 1 minute, whereas Turbo caps each clip at about 20 seconds (though you can merge them later).
- New Features and Customisation
- Tools like Remix, Re-Cut, Loop, and a fancier Storyboard.
- Input Methods
- Turbo accepts text, images, and even video-to-video prompts.
- Accessibility
- Both are available if you’re on OpenAI Plus or Pro, but usage limits differ.
Incorporating Personal Assets
Want to insert your own pictures or short clips? No problem:
- Media Upload
- Upload images or mini videos by clicking the “+” or “Upload” button.
- Customisation
- Blend your media with AI-generated visuals, or add transitions and visual effects.
- Privacy Settings
- If you don’t fancy sharing your personal content, just disable “Publish to Explore”.
General Guidelines for All Prompts
- Brevity: Keep it under 120 words—short and sweet!
- Specificity: Focus on one or two main ideas.
- Imagery: Paint a clear mental picture for the AI.
- Avoid Sensitive Content: Don’t poke the moderation bear.
- Build Complexity Slowly: If you want something intricate, iterate step by step.
Category-Specific Tips
- Sequence Prompts
- Works: Clear transitions or progressions (e.g., “A knight travelling across a desert, discovering a hidden oasis”).
- Doesn’t Work: Muddled, overly abstract sequences.
- Example: “An epic duel between a Balrog and a Paladin Platypus in a desert world.”
- Human-Focused Prompts
- Works: Humorous or relatable actions (e.g., “A mime crossing a marathon finish line”).
- Doesn’t Work: Anything too philosophical or jam-packed with details.
- Example: “A man strolling through a snowstorm, wearing a helmet made of raw meat.”
- Animal-Focused Prompts
- Works: Fun, vibrant scenarios (e.g., “Cats dressed as wizards facing camera and casting spells”).
- Doesn’t Work: Animals performing too many abstract or contradictory actions at once.
- Example: “A sabre-toothed tiger padding along a glowing riverbank in a prehistoric forest.”
- Figure-Focused Prompts
- Works: Distinctive, stylised scenes (e.g., “A weathered robot scavenging in an abandoned city”).
- Doesn’t Work: Mashing too many cultural icons into a single prompt.
- Example: “A superhero cameo reminiscent of anime, delivering a massive punch that shakes the earth.”
- Location-Focused Prompts
- Works: Captivating environmental descriptions (e.g., “Drone footage of ancient tribes on a mountain at sunset”).
- Doesn’t Work: Overdoing the details so that the setting becomes cluttered.
- Example: “A neon-drenched cityscape welcoming the year 2078, fireworks included.”
Prompt Refinement Checklist
- Clarity: Is your description straightforward and easy to follow?
- Engagement: Does your prompt conjure a strong mental image or storyline?
- Focus: Avoid cramming 10 different big ideas into one prompt.
- Tone: Pick a vibe—playful, cinematic, dramatic—and stick to it.
- Content Sensitivity: Steer clear of copyrighted figures or explicit subject matter.
Real-World Applications of Sora AI
- Social Media: Short, snappy clips for Instagram, TikTok, or YouTube Shorts.
- Storytelling: Quick teasers or imaginative sketches for your next big idea.
- Education: Bring tutorials or lessons to life with short explainers.
- Marketing: Spice up ad campaigns with unique, AI-generated flair.
Conclusion & Next Steps
All in all, Sora AI is a splendid tool for spinning text into visual gold—especially if you love creative, short-form storytelling. It’s not flawless, mind you: longer or more complex prompts can trip it up. But as a starting point for playful, cinematic, or downright quirky videos, it’s in a league of its own.
For a more professional setting—like detailed brand adverts or longer educational videos—Sora might need a bit more polish to handle intricacy. Still, it’s well worth a try if you’re eager to push the boundaries of AI-generated content.
Happy creating, folks!
Disclaimer: This guide blends community wisdom and publicly available resources. Use at your own discretion, and have fun exploring the wild world of Sora AI!
You may also like:
- What Is Sora AI?
- Kling: A Chinese AI Video Model Outshining Sora
- You can also access Sora AI by tapping here (paid for service, not available in all countries yet)
Author
Discover more from AIinASIA
Subscribe to get the latest posts sent to your email.
Tools
Google’s New AI Search Mode—An Early Sneak Peek
Discover how Google’s new AI Mode, tested internally, aims to revolutionise search results for more open-ended queries. Learn about its features, powered by Gemini 2.0, and how it might change your online search experience.
Published
1 week agoon
February 13, 2025By
AIinAsia
TL;DR – What You Need to Know in 30 Seconds
- Google is internally testing a new “AI Mode” in Search, powered by a custom version of Gemini 2.0.
- This new mode is aimed at more open-ended, exploratory queries like advice and product comparisons.
- The user interface is still in its early stages but features a conversational layout with AI-generated answers.
- Currently only tested by Google employees in the US, with a possible launch as early as this year.
- It points towards a more interactive, voice-friendly Search experience for users worldwide.
Google’s New AI Search Mode—What Do We Know Already?
Today, we’ve got a fascinating look at Google’s latest internal experiment with Search: a brand-new “AI Mode.” Don’t worry, it’s not a corny sci-fi flick—it’s Google’s first big step towards taking its already impressive Search capabilities to new heights.
If you’ve been following Google’s AI journey, you’ll know it’s been experimenting like mad lately. The search giant is clearly keen to outdo itself, especially given the fierce competition in the AI space. Let’s dive into this new “AI Mode,” see what it’s all about, and have a good old natter about why it might just change the way we find and digest information.
A Brief Overview
- What is AI Mode?
It’s a newly tested feature in Google Search, powered by a custom version of Gemini 2.0, designed to handle open-ended and exploratory queries more effectively. Rather than giving you a list of links, it provides breakdowns of information plus prompts to dig deeper. - Why now?
It’s Google’s push to see if there’s a demand for a more conversational, AI-driven approach. Traditional Search does the job fine for most queries, but those bigger “advice” or “comparison” style questions haven’t always been served as well. - What’s in it for you?
You can ask something like, “How many boxes of pasta do I need for a family party?” or “Which jacket material is best for cold, rainy winters in Asia?” and you’ll get an overview that’s more helpful than a standard list of links.
Powered by Gemini 2.0
We’ve heard loads of chatter about Google’s Gemini recently. It’s a powerful large language model that’s meant to be a big leap forward from previous AI systems. This “custom version” in AI Mode apparently brings advanced reasoning and thinking capabilities directly into your search bar. While that might sound like jargon to some, the bottom line is it should be more accurate and context-aware when dealing with follow-up questions.
An Early Look at the Interface
At the moment, a small group of Google employees in the US are testing this new interface. It’s a bit of a hybrid between the standard Search page and a chatbot. You’ll still see the query at the top, but instead of 10 blue links, you get a more elaborate AI-driven response. Meanwhile, there’s a card on the right (or somewhere in that ballpark) with links if you want to dive deeper.
Although the design is in its early stages, it’s clearly part of Google’s broader plan to offer a more interactive and informative search experience. And for all the mobile fans out there, yes—it’s designed to work on your phone, with voice input on Android and iOS. Handy if you’re on the go!
Why This Matters
So, why should you care? Whether you’re a business owner in Singapore researching the best marketing strategies, a student in Seoul comparing study-abroad programmes, or simply a curious netizen in Bangkok looking for aquascaping supplies, the new AI Mode could make your life a lot easier. Instead of sifting through link after link, you can get structured info right away, plus relevant links all in one place. It’s a bit like having your own dedicated research assistant.
When Can We Try Google’s Exploratory Search?
Well, at the moment, only Googlers in the US have access. Google CEO Sundar Pichai hinted that 2025 will be “one of the biggest years for Search innovation yet,” so we can’t guarantee an exact date, but it’s possible we’ll see a broader roll-out sometime this year. Keep an eye on AIinASIA for further updates, as we’ll keep you posted the second we get word.
Final Thoughts on Open-ended Queries
From classic web-search to voice-search and now AI-driven results, Google is clearly on a mission to streamline how we discover and digest information. This new AI Mode, powered by Gemini 2.0, appears to be a significant step forward in making Search even more conversational and context-aware.
For those of us in Asia, this can’t come soon enough. Imagine exploring local cuisines, planning trips, or learning new skills with AI Mode’s helpful breakdowns at your fingertips. The future of Search looks just a little bit brighter—and a lot more AI-driven.
That’s all for now, folks. Thanks for stopping by AIinASIA! As always, don’t forget to drop your thoughts below. We’d love to hear if you’re excited for Google’s new AI Mode, or if you’re a bit sceptical about AI chatbots changing the way we search.
Stay updated with AIinASIA for the latest on Google’s AI developments and everything else that’s hot in the world of artificial intelligence. If you liked this piece, do give it a share—and if you didn’t, share it anyway to let your mates know what’s coming next in the AI realm!
You may also like:
- ChatGPT Voice Mode: The Future of AI Interaction is Here!
- Google’s AI Overviews: A New Era of Information Theft?
- Try Google Gemini for free by tapping here.
Author
Discover more from AIinASIA
Subscribe to get the latest posts sent to your email.
Life
ChatGPT’s New Custom Traits: What It Means for Personalised AI Interaction
ChatGPT’s new trait assignment feature lets users personalise their chats by adjusting tone and style from Chatty to Gen Z, or Analytical.
Published
2 weeks agoon
February 6, 2025By
AIinAsia
TL;DR – What You Need to Know in 30 Seconds
- OpenAI has introduced a new trait assignment feature for ChatGPT, allowing users to personalise its tone and style.
- Users can specify a nickname, profession, and personality traits like “Chatty,” “Encouraging,” or “Gen Z.”
- This feature does not affect ChatGPT’s memory, which separately retains or forgets past interactions.
- The update is currently rolling out on ChatGPT.com and Windows, with mobile and MacOS versions coming soon.
- While this enhances customisation, it still relies on prompt engineering, meaning it doesn’t deeply change how the AI works.
- OpenAI moderates trait usage to prevent misuse and ensure compliance with its terms of service.
OpenAI’s New ChatGPT Update: Custom Traits for a Personalised Experience
OpenAI has launched a new customisation feature for ChatGPT, giving users the ability to assign personality traits to the AI. This update aims to enhance how users interact with ChatGPT, making conversations feel more natural and tailored to individual preferences.
Instead of a one-size-fits-all AI assistant, users can now adjust ChatGPT’s tone, engagement style, and personality to better match their needs—whether for professional tasks, casual conversation, or content creation.
How It Works: Assigning Traits to ChatGPT
Users now have greater control over ChatGPT’s persona by specifying:
- Preferred Name/Nickname – ChatGPT will refer to users by their chosen name.
- Profession – Users can provide their job title or field for more relevant responses.
- Traits – Users can choose from a variety of styles, such as:
- Chatty – More conversational and engaging.
- Encouraging – Supportive and motivational.
- Gen Z – A more informal, youthful style.
- Skeptical – More critical and questioning in responses.
- Analytical – Ideal for professional or logical discussions.
- Creative – A better fit for brainstorming and ideation.
- Concise – Focused on summarised, to-the-point replies.
- Empathetic – Suitable for more sensitive topics.
This feature is separate from ChatGPT’s memory, meaning it doesn’t remember details across different conversations but instead modifies responses in real time based on user input.
Is This a Deep AI Upgrade or a Simple UI Change?
Despite the buzz, this is not a fundamental change to ChatGPT’s underlying model. The feature relies on prompt engineering, meaning it adjusts responses rather than truly altering how the AI thinks or operates.
In essence, it’s a user-friendly way to tweak ChatGPT’s style using preset prompts rather than requiring users to manually provide detailed instructions.
Moderation Considerations
To prevent misuse or inappropriate modifications, OpenAI moderates the customisation options to ensure they align with its terms of service. While users can tailor ChatGPT’s personality, it won’t allow for harmful or misleading persona adjustments.
What’s Next? Expansion to More Platforms
The trait assignment feature is currently rolling out to:
- ChatGPT.com
- Windows desktop app
Coming soon: The update will be available on MacOS and mobile apps in the next few weeks.
However, users in the EU, Norway, Iceland, Liechtenstein, and Switzerland won’t have immediate access due to regulatory considerations.
Potential Impacts: A Step Towards More Personalised AI?
This update marks a shift towards making AI interactions feel more human-like and relatable. It’s particularly useful for:
- Businesses and professionals – More tailored, industry-specific AI interactions.
- Casual users – A more engaging and fun conversation experience.
- Content creators – An AI that aligns better with their preferred tone and style.
That said, it also raises questions about how personality traits might impact AI reliability, especially when it comes to factual accuracy and biases in different roles.
Final Thoughts
OpenAI’s new trait assignment feature is a welcome addition for users looking for a more personalised ChatGPT experience. While it doesn’t represent a deep technical shift, it streamlines customisation and could pave the way for even more user control over AI interactions in the future.
For now, the biggest takeaway is that ChatGPT can sound more like the assistant you want it to be—whether that’s chatty, analytical, or even a little bit Gen Z.
Let’s Talk AI!
How are you preparing for the AI-driven future? What questions are you training yourself to ask? Drop your thoughts in the comments, share this with your network, and subscribe for more deep dives into AI’s impact on work, life, and everything in between.
You may also like:
- Customising AI: Train ChatGPT to Write in Your Unique Voice
- The AI Age is Here—But Can You Ask the Right Questions?
- Or try the free version of ChatGPT by tapping here.
Author
Discover more from AIinASIA
Subscribe to get the latest posts sent to your email.

10 Prompts to Build Strong Vendor Relationships with ChatGPT

AI Notetakers in Meetings—Innovation, Invasion, or Something in Between?

10 Prompts to Transition into a New Role with ChatGPT
Trending
-
Tools3 weeks ago
Perplexity Assistant: The New AI Contender Taking on ChatGPT and Gemini
-
Business3 weeks ago
Microsoft 365 Copilot Chat: AI Productivity Without the Subscription
-
Life2 weeks ago
ChatGPT’s New Custom Traits: What It Means for Personalised AI Interaction
-
Learning3 weeks ago
AI in Asia for Beginners: A 2025 Guide to Getting Started
-
Learning7 days ago
Beginner’s Guide to Using Sora AI Video
-
News2 weeks ago
DeepSeek in Singapore: AI Miracle or Security Minefield?
-
News2 weeks ago
Meta AI’s Strategic Leap: Expansion into MENA
-
Tech2 weeks ago
Grok AI Goes Free: Can It Compete With ChatGPT and Gemini?