Gigasoft Tool Targets AI Code Hallucinations

Charting software company Gigasoft has released ProEssentials v10, which includes a tool designed to solve hallucinated property names in AI-generated code. The tool, `pe_query.py`, validates code against the compiled DLL binary to eliminate a common source of errors when using AI assistants for development.

- AI-generated code contains roughly 1.7 times more issues than code written by humans, with many errors only appearing during runtime when the code interacts with real data. This has led to the rise of "package hallucinations," where AI confidently generates code referencing non-existent software libraries, creating a security risk that malicious actors can exploit by creating harmful packages with those same hallucinated names. - To combat these errors, developers use a combination of static analysis and runtime validation. Static analysis tools scan source code for structural defects and known vulnerabilities before it runs, while runtime validation tests the code's behavior against actual production traffic to catch failures that static tools miss. - The primary method for aligning AI models and reducing harmful or incorrect outputs has been Reinforcement Learning from Human Feedback (RLHF), which uses human reviewers to rank model responses. However, the cost and inconsistency of human supervision at scale are significant bottlenecks, leading to the development of Constitutional AI. - Constitutional AI, a method used by Anthropic in their Claude models, relies on Reinforcement Learning from AI Feedback (RL-AIF), where the model critiques and refines its own outputs based on a written set of principles, or a "constitution". This approach reduces the dependency on human raters for every single data point, using them more for oversight. - For complex domains like code generation, the industry is shifting from large-scale, crowd-sourced data annotation to smaller, higher-quality datasets curated by domain experts. The quality of human feedback directly impacts the quality of the AI model, making expert validation crucial for building reliable systems. - Evaluating the performance of more autonomous, "agentic" AI systems requires new benchmarks beyond simple text generation. Frameworks like AgentBench, WebArena, and GAIA test agents on multi-step reasoning, web navigation, and tool usage to measure their real-world task success. - The fundraising landscape for AI startups is highly competitive, with investors concentrating capital in later-stage rounds for companies with clear traction. In the first half of 2025, AI startups in the U.S. raised $104.3 billion, but most exits have been smaller acquisitions rather than large IPOs, indicating a disconnect between investment and market liquidity. - The adoption of AI is transforming the labor market, with estimates suggesting it could expose nearly 40% of global jobs to some degree of change. While this creates anxiety about job displacement, it also drives demand for new skills in data analysis, machine learning, and creative thinking, leading to wage premiums for workers who upskill.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.