(more…) The post Parent Center Data Collection FAQs appeared first on Center for Parent Information and Resources.| Center for Parent Information and Resources
A 15-year study finds that veg*nism is rising among women, but not among men, with different motivations driving each group. The post Gender Gap Widens In Veg*n Diet Trends appeared first on Faunalytics.| Faunalytics
When people reduce their consumption of large-bodied animals but consume more small-bodied animal products, animal suffering is increased, as more animal lives are impacted. Faunalytics and Bryant Research conducted a meta-analysis of the small body problem to evaluate its prevalence and offer recommendations for advocates. The post Quantifying The Small Body Problem: A Meta-Analysis Of Animal Product Reduction Interventions appeared first on Faunalytics.| Faunalytics
Although the country’s cage-free egg farmers report a willingness to expand their operations, their underlying reasons vary. The post Japanese Egg Farmers Have Different Motivations To Go Cage-Free appeared first on Faunalytics.| Faunalytics
For consumers to choose plant-based dairy alternatives over animal-based dairy products, they usually need the capability, opportunity, and motivation to do so. This study explores how these factors play a role in making the switch. The post What Motivates Consumers To Choose Dairy Alternatives? appeared first on Faunalytics.| Faunalytics
Can tortoises be optimistic? For the first time, researchers use cognitive bias tests on reptiles to explore whether mood influences their behavior. The post More Than A Feeling: Evidence That Reptiles May Experience Moods appeared first on Faunalytics.| Faunalytics
Dog guardianship is increasing in Thailand. What does this mean for humane care and population management practices? The post Understanding Dog Guardianship In Greater Bangkok appeared first on Faunalytics.| Faunalytics
In China’s booming aquaculture industry, fish welfare is swimming under the radar. This study uncovers live transport conditions and stakeholders’ views.| Faunalytics
I have a large number of replacement rules where the right hand side (RHS) is always the same. To avoid repetition, I separated the two sides to add the same RHS to all of them. This is a minimal e...| Mathematica Stack Exchange
By Florian Keil, Melina Stein and Flurina Schneider. Is artificial intelligence, a technology aggressively advertised as the ultimate cure-all, fundamentally incompatible with transdisciplinarity and its decades-old insight that the “wicked” problems of the real world do not lend themselves to one-dimensional solutions? Should transdisciplinary research outright reject a technology that is already undermining efforts to ... Read more| Integration and Implementation Insights
Our discussion of How to Assess Artificial Intelligence Impacts in International Development? at the recent Technology Salon DC had its very premise questioned at the start. We should not be asking how we can assess AI when international development organizations decide to adopt the new technology. We must start assessing AI now. Why? Because we... The post We Must Assess Artificial Intelligence Impact Today in International Development appeared first on Technology Salon.| Technology Salon
When it comes to the exotic pet trade, news stories emphasize animal welfare, while peer-reviewed research addresses it from multiple angles — conservation in particular.| Faunalytics
Researchers critique the scientific literature on insect farming, revealing flawed assumptions about sustainability, feed sources, and economic viability.| Faunalytics
This study examines how foster environments and socialization practices influence kitten fear responses, providing essential knowledge for improving adoption success and long-term feline welfare.| Faunalytics
Kürzlich las ich in der „Süddeutschen Zeitung“ vom 21./22.6.25 einen Beitrag von Christian Weber mit dem Titel Mit Anstand leiden, in dem es um das Thema „Psychotherapie-Erfolg“ ging. Der Beitrag thematisiert die Grenzen moderner Psychotherapie und Psychopharmaka im Umgang mit psychischen Erkrankungen wie Depressionen.…| Joachim Funke
Im Rahmen einer Evaluation wurden die Einführung von Tempo 30 in einem großen Teil des Amsterdamer Hauptstraßennetzes untersucht. Betrachtet wurden unter anderem die Auswirkungen auf die…| Zukunft Mobilität
As systems change evaluators, we’ve had firsthand experience supporting philanthropy and foundation strategies and portfolio programs aimed at addressing wicked problems — from safe, affordable housing to better health outcomes for vulnerable groups. We’ve learned that more traditional evaluation approaches often fall short when tackling the dynamic and emergent nature of complex systems change. In […]| Unstuck by UNDP
The welfare needs of Japanese quails are understudied compared to other farmed birds. What do we know, and what more can we learn?| Faunalytics
Focus groups with Singapore families reveal that children are more curious about alternative proteins than their parents, who worry about naturalness.| Faunalytics
Millions of companion animals in the U.S. aren’t receiving the care they need — and the consequences can be devastating.| Faunalytics
Despite major public investment in Brazil-based JBS, the world’s largest meat producer, poverty and hunger are rising in the cities where the company operates.| Faunalytics
How can you eat fewer animal products — and stick to it? Start with a challenge, and bring your friends.| Faunalytics
We’re pleased to present the results of our annual community survey — including what you think we’re doing right, what we can do better, and plans for the future.| Faunalytics
Can large language models harm animals? The novel Animal Harm Assessment uncovers biases and blind spots in how these models talk about animals.| Faunalytics
We have worked together with Nifty Sustainability CIC on an in depth look at what Community Savers affiliates and partnerships are achieving through our community action with a focus on 2024. Below is an abridged version of sections from Nifty’s excellent independent evaluation report – its a great read! Download our 2024 Impact Evaluation here […] The post Savings, Spaces, and Solidarity: Community Savers in 2024 appeared first on Community Savers.| Community Savers
I previously tried (and failed) to setup LLM tracing for hinbox using Arize Phoenix and litellm. Since this is sort of a priority for being able to follow along with the Hamel / Shreya evals course with my practical application, I’ll take another stab using a tool with which I’m familiar: Braintrust. Let’s start simple and then if it works the way we want we can set things up for hinbox as well. Simple Braintrust tracing with litellm callbacks Callbacks are listed in the litellm docs as...| Alex Strick van Linschoten
It’s important to instrument your AI applications! I hope this can more or less be taken as given just as you’d expect a non-AI-infused app to capture logs. When you’re evaluating your LLM-powered system, you need to have capture the inputs and outputs both at an end-to-end level in terms of the way the user experiences things as well as with more fine-grained granularity for all the internal workings. My goal with this blog is to first demonstrate how Phoenix and litellm can work toget...| Alex Strick van Linschoten
I’ve been working on a project called hinbox - a flexible entity extraction system designed to help historians and researchers build structured knowledge databases from collections of primary source documents. At its core, hinbox processes historical documents, academic papers, books and news articles to automatically extract and organize information about people, organizations, locations, and events. The tool works by ingesting batches of documents and intelligently identifying entities ac...| Alex Strick van Linschoten
Group leaders nowadays bear a huge individual responsibility for raising money. But what if it was departments, and not group leaders, that were the unit of selection?| Total Internal Reflection
I came across this quote in a happy coincidence after attending the second session of the evals course: It’s obviously a bit abstract, but I thought it was a nice oblique reflection on the topic being discussed. Both the main session and the office hours were mostly focused on the first part of the analyse-measure-improve loop that was introduced earlier in the week. Focus on the ‘analyse’ part of the LLM application improvement loop It was a very practical session in which we even took...| Alex Strick van Linschoten
Key insights from the first session of the Hamel/Shreya AI Evals course, focusing on a 'three gulfs' mental model (specification, generalisation, and comprehension) for LLM application development…| mlops.systems
Investigating an audience-driven strategy for interactive interpretation prototyping in a science museum setting at the discovery phase of exhibition development. The post Prototyping the Power Hall: a decision-making process appeared first on Science Museum Group Journal.| Articles Archive - Science Museum Group Journal
Suppose you wanted better ratings of intelligence reports. Where could that improvement come from? How could you possibly get it? Some years ago the U.S. Office of the Director of National Intelligence proposed an answer of sorts. It was: use the method described in their Rating Scale document. In previous posts, I described various issues […]| Tim van Gelder
Let's check the Intune filter evaluation report options available in the Endpoint Manager (a.k.a. Intune) portal. The evaluation options can help you| How to Manage Devices Community Blog Modern Device Management Guides
A few weeks ago, I held a guest lecture at University of North Carolina Charlotte on how we can use large language models for annotation in the context of argument mining and fact verification. Here are the contents of that lecture in blog post format.| Lj Miranda
For a research project I am currently evaluating all kinds of generative AI models (mostly for visual artifacts but some text based ones as well). There also is somewhat of a push at my employer to use those systems more because of “efficiency”. So we all know that LLMs fabricate facts, meaning: They produce text […]| Smashing Frames
If I could give a message to high school seniors and their parents, I would tell them that a college choice (or a non-college choice) is not a report card on 13 years of education or 18 years of parenting. There is no final judgment. As with every stage of life, the key is finding...Read More| chinese grandma
Introduction I’ve been sitting on this blog for a while. But it’s been kicked out of the drafts, by events of an early morning research exploration, as I continue writing the book of the GM Moving journey and learning so far. It’s about the common recurring topic of line of sight, by which I mean […]| Hayley Lever
This post explores problems contributing to a benchmark crisis in LLM evaluation and potential solutions.| ruder.io
Chapter 10 of Chip Huyen’s “AI Engineering,” focuses on two fundamental aspects: architectural patterns in AI engineering and methods for gathering and using user feedback. The chapter presents a progressive architectural framework that evolves from simple API calls to complex agent-based systems, while also diving deep into the crucial aspect of user feedback collection and analysis. 1. Progressive Architecture Patterns The evolution of AI engineering architecture typically follows a p...| Alex Strick van Linschoten
This chapter was all about RAG and agents. It’s only 50 pages, so clearly there’s only so much of the details she can get into, but it was pretty good nonetheless and there were a few things in here I’d never really read. Also Chip does a good job bringing the RAG story into the story about agents, particularly in terms of how she defines agents. (Note that the second half of this chapter, on agents, is available on Chip’s blog as a free excerpt!) As always, what follows is just my no...| Alex Strick van Linschoten
This chapter represents a crucial bridge between academic research and production engineering practice in AI system evaluation. What sets it apart is the Chip’s very balanced perspective - neither succumbing to the prevalent hype in the field nor becoming overly academic. Instead, she melds together practical insights with theoretical foundations, creating a useful framework for evaluation that acknowledges both technical and ethical considerations. Introduction and Context Key Insight: The...| Alex Strick van Linschoten
Really enjoyed this chapter. My tidied notes from my readings follow below. 150 pages in and we’re starting to get to the good stuff :) Overview and Context This chapter serves as the first of two chapters (Chapters 3 and 4) dealing with evaluation in AI Engineering. While Chapter 4 will delve into evaluation within systems, Chapter 3 addresses the fundamental question of how to evaluate open-ended responses from foundation models and LLMs at a high level. The importance of evaluation canno...| Alex Strick van Linschoten
Here are the final notes from ‘Prompt Engineering for LLMs’, a book I’ve been reading over the past few days (and enjoying!). Chapter 10: Evaluating LLM Applications The chapter begins with an interesting anecdote about GitHub Copilot - the first code written in their repository was the evaluation harness, highlighting the importance of testing in LLM applications. The authors, who worked on the project from its inception, emphasise this as a best practice. Evaluation Framework When eva...| Alex Strick van Linschoten
Editor’s Note: This article was originally published by the College Fix on December 13, 2024. With edits to match Minding the Campus’s style guidelines, it is crossposted here with permission. A University of Illinois student reported his accounting professor to the school’s bias team after she made a “passive-aggressive” comment concerning student course evaluations. The College […]| Minding The Campus
We are excited to announce the release of an innovative research report on social capital in Guernsey, offering insights into the strengths and weaknesses of the island’s social networks and culture, as well as the evolving challenges it faces. Using a cutting-edge methodology with similarities to a SWOT analysis in strategic planning, this study combines […] The post New Research Report on Social Capital in Guernsey Unveiled appeared first on Institute for Social Capital.| Institute for Social Capital
Two-year findings and recommendations from our evaluation of a deferred prosecution program in Winnebago County, Illinois.| Loyola University of Chicago Center for Criminal Justice
New research has shed light on the significant public health, equity and financial benefits of New York City's Vision Zero initiative! The study shows how Vision Zero road safety work not only reduced deaths and injuries, but also reduced healthcare related expenditures, particularly benefiting NYC’s low-income and Black residents.| Vision Zero Network
Corporate influence management is a strategic approach used by organizations to shape perceptions, drive public opinion, and foster positive relationships with stakeholders. It involves the deliberate management of a company’s interactions with key influencers, including media, industry leaders, and public figures, to enhance its reputation and achieve its business objectives. Effective corporate influence management helps […]| DotCom Magazine-Influencers And Entrepreneurs Making News
The International Nursing Association for Clinical Simulation and Learning (INACSL) has developed the “INACSL Standards of Best Practice: Simulation” to advance the science of simulation, share the...| HealthySimulation.com
A triteral partnership between Primeval Energy, IGM, and DGeG has been signed for a detailed evaluation of the geothermal potential in Malaysia.| Think GeoEnergy - Geothermal Energy News
Process evaluation of an Effective Practices in Community Supervision (EPICS) program in Cook County, Illinois.| Loyola University of Chicago Center for Criminal Justice
Technical Report to Reducing Revocations Challenge: The Cook County (Chicago) Adult Probation Department and Loyola University Chicago Action Research Team Final Report| Loyola University of Chicago Center for Criminal Justice
Process evaluation of a Focused Deterrence Intervention pilot program.| Loyola University of Chicago Center for Criminal Justice
Healthcare simulation tools are crucial to improve clinical education, training, patient care, research, and operational outcomes. The use of metrics, measurement, and tools is an essential component of a comprehensive Simulation Program Evaluation. Think of metrics as the yardstick by which one measures simulation program success. Metrics can also refer to the standards of measurement| HealthySimulation.com
AN OUTLINE FOR THE ANALYSIS OF SPECIAL FEATURE ARTICLES I. SOURCES OF MATERIAL 1. What appears to have suggested the subject to the writer? 2. How much of the article was based on his... The post Rubric for Feature Articles appeared first on Excellence in Literature by Janice Campbell.| Excellence in Literature by Janice Campbell
In July 2024, the Global Fund for Community Foundations (GFCF) extended their global reach to include Australia in the #ShiftThePower movement. I was fortunate enough to be invited to Bali to attend a 3-day meeting with 19 organisations from civil society around the world. Those in the meeting inclu| Another Way Is Possible
I was on the front page of Hacker News for my two last blog posts and I learned various things forom the discussion and scrutiny of my approach to evaluating my finetuned LLMs.| mlops.systems
I summarise the kinds of evaluations that are needed for a structured data generation task.| mlops.systems
I evaluated the baseline performance of OpenAI's GPT-4-Turbo on the ISAF Press Release dataset.| mlops.systems
How the debriefing process - essential to success in the world of flight - can help leaders achieve excellence.| TrainingZone
Holistic Candidate Evaluation for recruiting and sourcing with skills| RecruitingDaily
How did nonprofits change because of COVID-19?: Over the past three years, the nonprofit sector has undergone a profound change. A new study, spanning 2020 to 2023, examines the changes nonprofits made in response to COVID-19 and looks at the characteristics of the most adaptive nonprofits.| Theory of Change Community
Supporting Gender Equality in the Public Sector At the Centre for Family Research and Evaluation (CFRE), we are excited to continue to contribute to state-wide progress towards gender equality with the Commission for Gender Equality in the Public Sector. We have recently been appointed a member of the expert Panel of Providers, within the evaluation stream, and are looking forward to supporting defined entities comply with the Gender Equality Act. […] The post Supporting Gender Equality in ...| Centre for Family Research and Evaluation
A logic model was developed to track progress and ensure everyone understands the project's goals and how their work contributes to its success.| National Disability Center for Student Success
Some thoughts on how we could capitalise on Mozilla's Open Badge Framework as a way to enable trustworthy reporting of web site accessibility.| The '58 sound
We reported interim findings from an ongoing evaluation of a deferred prosecution program in Winnebago County, Illinois.| Loyola University of Chicago Center for Criminal Justice
Birmingham Festival 23 today shares the full evaluation results that demonstrate the positive impact the 10-day free event had on the city. The Festival| Birmingham Festival 23
In the previous post I raised various issues with the ODNI Rating Scale for intelligence products. At least some of these are serious problems. Some time in the future ODNI might put out a new and …| Tim van Gelder
This post considers a number of issues with analytic product evaluation method described in the document ODNI Rating Scale for Evaluating Analytic Tradecraft Standards (Rating Scale), the mo…| Tim van Gelder
Learn how to evaluate LLMs and RAG pipelines using Langchain and Hugging Face| www.philschmid.de
The Center collaborated with the Illinois Department of Corrections’ Planning and Research Unit to examine the number, characteristics, and circumstances of individuals on Mandatory Supervised Release (MSR, or “parole”) who violated conditions of their release and were returned to prison (i.e., “technical violators”).| Loyola University of Chicago Center for Criminal Justice
Some Insights from Illinois Practitioner Interviews| Loyola University of Chicago Center for Criminal Justice
This report by researchers at the Center for Criminal Justice Research, Policy, and Practice at Loyola University Chicago analyzes the impact of bond reform in Cook County on felony bond court decisions, pretrial release, and crime.| Loyola University of Chicago Center for Criminal Justice