[BtB | PO Diary] Wk 1: First Impressions in the AI Lab

Diary

Jul 12

Welcome back to our "Behind the Bot" series! This week marks our inaugural dive into the nitty-gritty of testing AI chatbots designed for refugee protection. Buckle up, because we're about to get technical, emotional, and maybe even a little philosophical.

Bots in the Spotlight (or Testing Lab):

Our first week was all about getting acquainted with our digital colleagues: GPT3.5, GPT4, Claude, and Gemini. They're like the Avengers of AI, each with unique strengths and quirks. It's too early for a full performance review, but we've already noticed Claude showing promise with its surprisingly analytical responses and empathetic language.

A Glimpse into the Testing Process:

Our testing involved throwing a variety of questions at these AI whiz kids, from simple inquiries about asylum procedures to complex ethical dilemmas. We wanted to see how they handle different situations, what kind of language they use, and how well they understand the nuances of the refugee experience.

Early Lessons and Surprises:

Emotional Intelligence? Claude surprised us with its ability to express empathy and even apologize when it lacked information. Could AI chatbots actually become trusted companions for those in crisis?
Reference Check: We noticed some discrepancies between the information provided by the bots and the sources they cited. This highlights the need for rigorous fact-checking and validation, even when working with "intelligent" systems.
The Human Touch: While AI can provide valuable information, it's clear that human moderators play an irreplaceable role. They bring context, cultural understanding, and the ability to connect with people on a personal level.

Challenges and Opportunities:

Our first week also revealed some challenges:

Understanding Risk: The bots struggled to identify risk factors and vulnerabilities, a critical aspect of protection work. We need to find ways to teach them to recognize subtle cues and tailor their responses accordingly.
Tone and Language: While some bots showed a good grasp of empathetic language, others sounded a bit... robotic. We need to refine their communication style to make it more welcoming and supportive.
Directive vs. Informative: There's a fine line between providing guidance and being overly directive. We need to ensure the bots empower refugees to make their own decisions, rather than telling them what to do.

What's Next?

We're just scratching the surface, but this first week has already given us valuable insights. We'll continue our deep dive into the world of AI testing, exploring topics like:

How can we teach AI to understand cultural nuances and social contexts?
What are the ethical implications of using AI in protection work?
How can we create a seamless collaboration between AI and human moderators?

Stay tuned for our next blog post, where we'll share more updates and delve into the next phase of our testing journey. We're excited to see how these AI chatbots evolve and what they can ultimately achieve in the field of humanitarian aid.

#TechForGood #HumanitarianAI #AIWithHeart #DigitalProtection

Eftychia Georgiadi

[BtB | PO Diary] Wk 1: First Impressions in the AI Lab

Using AI to Anonymize Sensitive Client Data

LLM Showdown: Choosing Ethical & Effective AI