{
  "version": "https://jsonfeed.org/version/1",
  "title": "Artificial Intelligence on Edward Kiledjian",
  "icon": "https://avatars.micro.blog/avatars/2025/35/1555731.jpg",
  "home_page_url": "https://kiledjian.com/",
  "feed_url": "https://kiledjian.com/feed.json",
  "items": [
      {
        "id": "http://ekiledjian2.micro.blog/2026/06/19/the-mitlicensed-frontier-why-glm.html",
        "title": "The MIT-Licensed Frontier: Why GLM-5.2 Reshapes Enterprise AI Trade-Offs",
        "content_html": "<p>Enterprise artificial intelligence strategy is shifting from model selection to control architecture selection.</p>\n<p>As organizations move from experimental deployments toward production-grade agentic systems, the dominant constraints are no longer model performance alone. They are increasingly defined by control over weights, data residency, licensing structure, and operational governance boundaries.</p>\n<p>The release of GLM-5.2 by Z.ai (Zhipu AI) reflects this shift. Based on publicly available technical documentation and reported benchmark evaluations, the model is positioned as a large-scale mixture-of-experts system targeting frontier-level capability in software engineering and multi-step reasoning tasks.</p>\n<p>Its significance is not isolated performance. It is the combination of capability, deployment flexibility, and permissive licensing under a widely used open-source framework.</p>\n<hr>\n<h1 id=\"from-prompt-driven-use-to-agentic-engineering\">From Prompt-Driven Use to Agentic Engineering</h1>\n<p>Enterprise adoption of large language models has historically been dominated by prompt-centric workflows. In this model, systems are used as stateless interfaces that generate discrete outputs without persistent operational context.</p>\n<p>While effective for productivity augmentation, this approach does not scale well to complex engineering environments involving long-running workflows, system-level orchestration, or multi-repository codebases.</p>\n<p>A structural shift is now underway toward agentic engineering, where models operate as components within coordinated systems rather than standalone tools.</p>\n<p>For example, agentic systems may be used to coordinate multi-repository refactoring or automate security patch triage across distributed codebases.</p>\n<p>Within this framing, GLM-5.2 is positioned as part of a class of systems designed for long-horizon execution in software engineering environments involving iterative debugging, tool-assisted workflows, and structured reasoning over large codebases.</p>\n<p>Public technical descriptions suggest three broad capability directions:</p>\n<ul>\n<li>Extended context handling for large-scale code and data environments</li>\n<li>Asynchronous reinforcement learning approaches intended to improve iterative system behaviour</li>\n<li>Safety and integrity mechanisms designed to reduce reward manipulation in automated evaluation environments</li>\n</ul>\n<p>While implementation details vary across available documentation, the strategic direction is consistent: improved reliability in multi-step, tool-mediated execution environments.</p>\n<hr>\n<h1 id=\"reported-benchmark-positioning-contextual-not-absolute\">Reported Benchmark Positioning (Contextual, Not Absolute)</h1>\n<p>Public benchmark summaries suggest GLM-5.2 is positioned within the upper tier of recent frontier models on selected software engineering and reasoning tasks.</p>\n<p>These evaluations are typically conducted on structured benchmarks involving multi-step reasoning and code generation tasks, often compared against proprietary systems from leading AI providers.</p>\n<p>It is important to note that cross-model comparisons are sensitive to:</p>\n<ul>\n<li>evaluation methodology</li>\n<li>inference configuration</li>\n<li>tool availability</li>\n<li>compute budget assumptions</li>\n</ul>\n<p>As a result, performance comparisons should be interpreted as conditional rather than absolute.</p>\n<p>The broader signal is more important than any single metric: the performance gap between open-weight systems and proprietary API-based models continues to narrow in specific agentic and coding-focused workloads.</p>\n<hr>\n<h1 id=\"the-licensing-shift-why-mit-matters-in-practice\">The Licensing Shift: Why MIT Matters in Practice</h1>\n<p>A defining characteristic of GLM-5.2 is its release under the MIT License, one of the most permissive open-source licences in widespread use.</p>\n<p>From an enterprise perspective, this has structural implications. However, it is critical to distinguish licensing freedom from regulatory compliance or operational readiness.</p>\n<p>MIT licensing primarily governs reuse and redistribution rights. It does not provide exemption from privacy law, sector-specific regulation, or internal governance requirements.</p>\n<p>Within that boundary, three practical implications emerge.</p>\n<hr>\n<h2 id=\"1-increased-deployment-control-and-data-perimeter-flexibility\">1. Increased Deployment Control and Data Perimeter Flexibility</h2>\n<p>Permissive licensing enables deployment within controlled infrastructure environments, including private cloud and isolated compute clusters.</p>\n<p>For regulated organizations, this can reduce dependency on external APIs and improve control over sensitive data flows.</p>\n<p>However, operational reality introduces additional complexity. Secure deployment requires governance across:</p>\n<ul>\n<li>model supply chain integrity</li>\n<li>dependency stacks and runtime environments</li>\n<li>access control and logging frameworks</li>\n</ul>\n<p>Self-hosting increases control, but also increases operational responsibility.</p>\n<hr>\n<h2 id=\"2-reduced-dependency-on-external-ai-platforms\">2. Reduced Dependency on External AI Platforms</h2>\n<p>Proprietary AI APIs introduce structural dependencies on vendor pricing, availability, policy changes, and jurisdictional constraints.</p>\n<p>Self-hosted models reduce exposure to these risks by shifting inference and lifecycle control into enterprise-managed infrastructure.</p>\n<p>This represents a redistribution of risk rather than its elimination.</p>\n<p>In practice, it reduces vendor dependency but increases internal engineering and operational burden.</p>\n<hr>\n<h2 id=\"3-flexibility-in-model-adaptation-and-distillation\">3. Flexibility in Model Adaptation and Distillation</h2>\n<p>Permissive licensing enables fine-tuning and distillation into smaller models optimized for domain-specific use cases.</p>\n<p>This supports a layered enterprise architecture where:</p>\n<ul>\n<li>large models handle complex reasoning and planning tasks</li>\n<li>smaller models support high-volume, latency-sensitive operations</li>\n</ul>\n<p>Self-hosting becomes economically rational when sustained inference utilization justifies dedicated GPU allocation across multiple concurrent workloads.</p>\n<hr>\n<h1 id=\"operational-and-security-considerations\">Operational and Security Considerations</h1>\n<p>Despite its advantages, GLM-5.2 is not a universal replacement for proprietary frontier systems.</p>\n<p>In certain long-horizon or highly complex tasks, proprietary models continue to demonstrate stronger consistency, ecosystem maturity, and tool integration support.</p>\n<p>Additionally, enterprise deployment introduces non-trivial security considerations:</p>\n<ul>\n<li>Model provenance risk, including integrity of downloaded weights</li>\n<li>Inference-layer attack surfaces such as prompt injection in tool-using agents</li>\n<li>Supply chain dependencies across GPU drivers and inference frameworks</li>\n<li>Operational isolation challenges in environments marketed as “air-gapped”</li>\n</ul>\n<p>In agentic deployments, the dominant risk shifts from model misuse to control-plane compromise.</p>\n<p>These factors require formal threat modeling prior to production deployment.</p>\n<hr>\n<h1 id=\"economic-and-infrastructure-trade-offs\">Economic and Infrastructure Trade-Offs</h1>\n<p>Self-hosting frontier-scale models introduces a fundamentally different cost structure compared to API-based consumption.</p>\n<p>Rather than variable usage-based pricing, organizations assume:</p>\n<ul>\n<li>capital expenditure for compute infrastructure</li>\n<li>ongoing operational costs for maintenance and scaling</li>\n<li>specialized engineering effort for deployment optimization</li>\n</ul>\n<p>As a result, hybrid architectures combining external APIs with internal models are likely to remain the dominant enterprise pattern.</p>\n<hr>\n<h1 id=\"strategic-implications-for-enterprise-architecture\">Strategic Implications for Enterprise Architecture</h1>\n<p>For technology and security leaders, the emergence of systems such as GLM-5.2 reinforces several structural shifts:</p>\n<ul>\n<li>control is now a first-order architectural constraint</li>\n<li>licensing terms directly influence deployment feasibility</li>\n<li>hybrid architectures are becoming the default enterprise pattern</li>\n<li>governance maturity increasingly determines AI adoption scope</li>\n</ul>\n<p>These dynamics reflect a broader rebalancing of enterprise AI strategy toward controllability, risk segmentation, and architectural flexibility.</p>\n<hr>\n<h1 id=\"closing-perspective\">Closing Perspective</h1>\n<p>The enterprise AI landscape is entering a phase where performance differentials between leading models are narrowing in specific domains, particularly software engineering and agentic workflows.</p>\n<p>As this convergence continues, structural factors—licensing, deployment control, governance maturity, and operational risk—are becoming primary differentiators in enterprise decision-making.</p>\n<p>GLM-5.2 should therefore be understood not as a singular technological breakthrough, but as an indicator of where the market is moving: toward distributed, hybrid, and controllable AI systems where sovereignty and capability must be balanced against operational complexity and risk.</p>\n<hr>\n<h1 id=\"ethics-statement\">Ethics Statement</h1>\n<p>This article is written in accordance with principles of transparency, analytical independence, and responsible interpretation of emerging artificial intelligence systems. It is intended to provide strategic insight rather than promotional or vendor-aligned positioning.</p>\n<p>Readers should interpret all model capabilities, benchmarks, and architectural claims as subject to change, variation in deployment context, and differences in evaluation methodology. Independent validation is recommended prior to any production use or procurement decision.</p>\n<hr>\n<h1 id=\"disclaimer\">Disclaimer</h1>\n<p>The information provided in this article is for informational and analytical purposes only. It does not constitute legal, security, or procurement advice.</p>\n<p>Model performance characteristics, licensing interpretations, and benchmark results may vary depending on implementation, infrastructure configuration, quantization approach, and upstream changes.</p>\n<p>Readers are responsible for conducting their own due diligence, including security validation, compliance assessment, and operational testing, prior to deploying any model in production environments.</p>\n<hr>\n<p>Keywords: #AI #ArtificialIntelligence #EnterpriseAI #GenerativeAI #AgenticAI #OpenSourceAI #AIGovernance #AISecurity #ResponsibleAI #Cybersecurity #CISO #CTO #EnterpriseArchitecture #TechnologyStrategy #DigitalTransformation #DataSovereignty #ModelGovernance #RiskManagement #InformationSecurity #CloudSecurity #AIInfrastructure #MachineLearning #LLM #OpenWeights #MITLicense #HybridAI #TechnologyLeadership #Innovation #DataPrivacy #EnterpriseSecurity</p>\n<img src=\"https://cdn.uploads.micro.blog/255457/2026/aipic.png\">",
        "date_published": "2026-06-19T09:09:17-04:00",
        "url": "https://kiledjian.com/2026/06/19/the-mitlicensed-frontier-why-glm.html",
        "tags": ["Artificial Intelligence","Technology \u0026 Business"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2026/04/23/the-first-hurdle-is-the.html",
        
        "content_html": "<p><a href=\"https://www.itpro.com/business/business-strategy/the-first-hurdle-is-the-hardest-in-generative-ai-adoption-and-businesses-keep-falling\" target=\"_blank\" rel=\"noopener noreferrer\">The first hurdle is the hardest in generative AI adoption – and businesses keep falling | IT Pro</a></p>\n<p>Despite rapid AI adoption, many businesses struggle with implementation, falling into &ldquo;pilot purgatory&rdquo; due to issues like skills gaps, legacy systems, and a lack of advanced use cases. While employees report individual productivity gains, companies are slow to achieve business-wide benefits, with a significant portion of firms still in basic AI application stages.</p>\n",
        "date_published": "2026-04-23T15:24:08-04:00",
        "url": "https://kiledjian.com/2026/04/23/the-first-hurdle-is-the.html",
        "tags": ["Artificial Intelligence","Technology \u0026 Business"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2026/03/24/personas-in-ai-friend-or.html",
        "title": "Personas in AI, friend or foe?",
        "content_html": "<p>Are you using persona prompts with AI? Here&rsquo;s what the research actually says.</p>\n<p><a href=\"https://arxiv.org/html/2603.18507v1\" target=\"_blank\" rel=\"noopener noreferrer\">arxiv.org/html/2603&hellip;</a></p>\n<p>A new study from USC (&ldquo;Expert Personas Improve LLM Alignment but Damage Accuracy&rdquo;) tested expert persona prompts across six large language models and finally explains why the community has seen such mixed results.</p>\n<p>The finding is simple but important: persona prompts are an alignment tool, not a knowledge tool.</p>\n<p>When personas HELP:\n→ Writing tone and style (scores jumped from 7/10 to 9/10 on professional email drafting)\n→ Safety and refusal (jailbreak resistance improved by up to 17.7%)\n→ Format adherence, structured output, and intent following\n→ Longer, more detailed persona descriptions amplify these gains</p>\n<p>When personas HURT:\n→ Factual accuracy and knowledge retrieval (accuracy dropped from 71.6% to 68.0%)\n→ Math and logical reasoning (one example went from 9/10 to 1.5/10)\n→ Coding tasks requiring precise recall\n→ Longer personas make the damage worse</p>\n<p>Five things you can do right now:</p>\n<ol>\n<li>\n<p>Use personas for creative, editorial, and compliance-sensitive tasks. Drop them for factual lookups, calculations, and code logic.</p>\n</li>\n<li>\n<p>Place personas in the system prompt, not the user message — it matters on well-optimized models.</p>\n</li>\n<li>\n<p>If you&rsquo;re using reasoning models (like DeepSeek R1), skip expert personas entirely. The research shows a random persona works just as well — the model only benefits from added context length, not expertise.</p>\n</li>\n<li>\n<p>For safety hardening, a dedicated &ldquo;safety monitor&rdquo; persona in the system prompt is one of the cheapest and most effective interventions available.</p>\n</li>\n<li>\n<p>When you must use a persona on accuracy-sensitive work, keep it as short as possible to minimize interference with factual recall.</p>\n</li>\n</ol>\n<p>The bottom line: treat persona prompts like a tone and alignment amplifier, not a knowledge enhancer. Knowing when to use them — and when to strip them out — is a real competitive advantage.</p>\n<p>Paper: &ldquo;Expert Personas Improve LLM Alignment but Damage Accuracy: Bootstrapping Intent-Based Persona Routing with PRISM&rdquo; (Hu, Rostami, Thomason — USC, 2026)</p>\n",
        "date_published": "2026-03-24T07:29:00-04:00",
        "url": "https://kiledjian.com/2026/03/24/personas-in-ai-friend-or.html",
        "tags": ["Artificial Intelligence"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2026/03/10/codewall-says-it-hacked-mckinseys.html",
        "title": "CodeWall says it hacked McKinsey’s AI platform. Here’s what holds up — and what doesn’t.  ",
        "content_html": "<p>This reflects my personal assessment of publicly available reporting and CodeWall’s published blog post. I was not involved in the testing, I do not have access to McKinsey’s internal facts or forensic findings, and my views should be read as commentary and opinion rather than statements of verified fact.</p>\n<p>A security startup called CodeWall claims its autonomous agent compromised McKinsey’s internal AI platform, Lilli, within two hours and gained unauthenticated read-write access to a production database containing tens of millions of consultant conversations. The vulnerability appears credible. The claimed scope of impact is not fully evidenced. The primary CodeWall post is here: <a href=\"https://codewall.ai/blog/how-we-hacked-mckinseys-ai-platform.\" target=\"_blank\" rel=\"noopener noreferrer\">codewall.ai/blog/how-&hellip;</a> Independent reporting by Jessica Lyons in The Register is here: <a href=\"https://www.theregister.com/2026/03/09/mckinsey_ai_chatbot_hacked/.\" target=\"_blank\" rel=\"noopener noreferrer\">www.theregister.com/2026/03/0&hellip;</a></p>\n<h2 id=\"what-is-likely-true\">What is likely true</h2>\n<p>The attack chain CodeWall describes — publicly exposed API documentation, unauthenticated endpoints, SQL injection through unsafely handled JSON keys and IDOR chaining — is plausible and technically sound. JSON key injection is an uncommon vector. Most security testing tools and methodologies focus on input values, not field names. If Lilli’s backend parameterized values while concatenating keys directly into SQL, that would create a blind spot many assessments could miss.</p>\n<p>McKinsey’s response supports the credibility of the finding. In The Register, journalist Jessica Lyons reported that McKinsey acknowledged the issues, patched them within hours and said its forensic review found no evidence that client data or confidential information were accessed by the researcher or any unauthorized party. That report also quotes CodeWall CEO Paul Price on the company’s use of an autonomous agent.</p>\n<p>The prompt-layer risk CodeWall highlights is also substantive. If Lilli’s system prompts — the instructions governing how the AI behaves — were stored in the same database to which the agent had write access, an attacker could alter AI behaviour at scale without a traditional code deployment and potentially outside standard release controls. Many organizations have not explicitly modelled this threat, and prompt-layer integrity controls remain immature in many environments.</p>\n<h2 id=\"what-is-overstated-or-unproven\">What is overstated or unproven</h2>\n<p>CodeWall claims 46.5 million chat messages, 728,000 files, 57,000 user accounts and hundreds of thousands of AI configurations were accessible. The blog provides no proof-of-concept payloads, no hashes, no screenshots and no evidence showing privilege boundaries. It is unclear whether those figures represent records the agent actually retrieved, database row counts inferred from metadata or something in between.</p>\n<p>More importantly, the blog conflates three categories that any security professional should keep separate: what was theoretically reachable, what was actually accessed and what was verified as exfiltrated. CodeWall emphasizes reachability. McKinsey’s statement addresses investigated access. Both could be true at the same time, but the blog does not clearly distinguish between them.</p>\n<p>The two-hour timeline also deserves scrutiny. Blind SQL injection is typically slow because extraction happens incrementally. The post suggests verbose error messages may have accelerated discovery, which implies the path may have combined error-assisted identification with later blind or semi-blind extraction. That is plausible, but the article does not provide enough technical detail to substantiate a claim of full production read-write access within two hours and 15 iterations.</p>\n<p>The assertion that a modified prompt “leaves no log trail” is also too absolute. Whether prompt tampering is detectable depends on the target’s database audit logging, configuration versioning and anomaly detection. Mature organizations may log or detect these events. The blog presents the point too categorically.</p>\n<h2 id=\"what-is-concerning-about-the-disclosure-itself\">What is concerning about the disclosure itself</h2>\n<p><strong>Autonomous target selection</strong></p>\n<p>CodeWall presents the fact that its agent independently chose McKinsey as a target as a feature. An AI system deciding whom to attack — even if limited to organizations with disclosure policies — raises serious questions about operator control, authorization and liability. That issue deserves careful scrutiny, not celebration.</p>\n<p><strong>Unresolved scope authorization</strong></p>\n<p>The blog cites McKinsey’s HackerOne responsible disclosure policy as justification, but neither the blog nor independent reporting confirms whether Lilli’s production infrastructure was explicitly in scope for that programme. A disclosure policy is not blanket authorization to enumerate a production database. McKinsey’s public policy is referenced by CodeWall here: <a href=\"https://hackerone.com/mckinsey-and-company.\" target=\"_blank\" rel=\"noopener noreferrer\">hackerone.com/mckinsey-&hellip;</a></p>\n<p><strong>Rushed disclosure</strong></p>\n<p>The issue was discovered Feb. 28, 2026. The public blog was published March 9. McKinsey may have patched quickly, but rapid remediation is not the same as a completed forensic review, variant analysis and confirmation that the vulnerability had not previously been exploited by others. Nine days is a compressed window for all of that.</p>\n<p>The published timeline also appears to contain a date inconsistency issue discussed in commentary around the post. If there was a typo in an earlier version, it is minor. Even so, in a report making very large claims, editorial sloppiness weakens confidence.</p>\n<h2 id=\"what-security-leaders-should-take-away\">What security leaders should take away</h2>\n<p>This is a conventional application security failure on a platform that happens to run AI workloads. The described attack path — exposed documentation, missing authentication, SQL injection, verbose errors and IDOR — is textbook web and API security. Framing it as an “AI platform hack” is effective marketing. Technically, it is a severe application security failure with AI-specific consequences.</p>\n<p>Two lessons are worth acting on regardless of the blog’s evidentiary gaps.</p>\n<p>First, treat your AI prompt and configuration layer as a crown-jewel asset. If system prompts reside in the same data store as operational data, and that store is reachable through any injection or access-control flaw, you have created a single point of compromise that can silently alter AI behaviour at scale. Apply integrity controls, versioning and monitoring accordingly.</p>\n<p>Second, audit for JSON key injection. If any application accepts JSON in which field names are dynamic, and those names are later used in query construction — whether SQL, NoSQL or ORM-generated queries — standard scanning tools may miss it. That requires targeted review.</p>\n<p><strong>The bottom line:</strong> CodeWall likely found a serious vulnerability. Its blog overstates what was proven, blurs critical distinctions between access and exfiltration, and leaves unresolved questions about authorization and disclosure discipline. The strategic lesson is real, but it is about secure architecture, access control and prompt integrity — not a new class of AI exploit.</p>\n<p><strong>Sources and named parties referenced:</strong> CodeWall; McKinsey &amp; Company; Paul Price, CEO of CodeWall; Jessica Lyons, The Register.</p>\n<h2 id=\"ethics-statement\">Ethics statement</h2>\n<p>This article is intended to support informed discussion about a publicly reported security incident involving CodeWall’s claims about McKinsey’s AI platform, Lilli. It aims to distinguish clearly between CodeWall’s published assertions, McKinsey’s public response, independent media reporting and the author’s professional interpretation. Where facts remain unverified, disputed or incomplete, that uncertainty is stated rather than assumed away. This article does not endorse unauthorized testing, autonomous target selection or activity that exceeds clearly defined responsible disclosure boundaries.</p>\n<h2 id=\"disclaimer\">Disclaimer</h2>\n<p>This article is provided for general information, commentary and discussion purposes only. It is not legal, security, privacy, compliance or other professional advice, and it should not be relied upon as such. The analysis is based on publicly available information at the time of writing, including CodeWall’s blog post, McKinsey’s public statements and independent reporting. The author was not involved in the testing, does not have access to McKinsey’s internal systems, logs or forensic findings, and cannot independently verify all technical or factual claims made by the parties involved. Any errors or omissions are unintentional. The views expressed are those of the author in a personal capacity and do not represent the views of any employer, client, partner or affiliated organization. Generative AI tools were used to assist with research and editing.</p>\n<p>Keywords : #CyberSecurity #AppSec #AI #AIAgents #AISecurity #LLMSecurity #PromptSecurity #PromptInjection #ResponsibleDisclosure #VulnerabilityDisclosure #BugBounty #HackerOne #SQLInjection #IDOR #APISecurity #WebSecurity #SecurityResearch #ThreatModeling #SecureByDesign #SecurityLeadership #RiskManagement #DigitalTrust #InfoSec #SecurityGovernance #DataSecurity #CloudSecurity #RedTeam #BlueTeam #CyberRisk #McKinsey</p>\n<img src=\"https://cdn.uploads.micro.blog/255457/2026/4ac71a6675.png\">",
        "date_published": "2026-03-10T07:51:40-04:00",
        "url": "https://kiledjian.com/2026/03/10/codewall-says-it-hacked-mckinseys.html",
        "tags": ["Artificial Intelligence","Cybersecurity \u0026 Privacy"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2026/03/06/deerflow-bytedances-opensource-ai-agent.html",
        "title": "DeerFlow 2.0: ByteDance’s open-source AI agent harness for research and software tasks",
        "content_html": "<p>DeerFlow 2.0, an open-source project from ByteDance, has quickly become one of the most visible AI agent releases of early 2026. The project’s public repository says it reached No. 1 on GitHub Trending on Feb. 28, 2026, and the repository currently shows about 25,000 stars and 3,000 forks. For teams evaluating agentic systems, DeerFlow deserves attention, but it also warrants disciplined review.</p>\n<p>I have been testing DeerFlow 2.0 over the past week. The short version is this: it is more capable and more complete than many open-source agent projects, but some of the public enthusiasm around it is running ahead of careful governance, privacy and security assessment. For a business, IT, security and privacy audience, that distinction matters.</p>\n<h3 id=\"what-deerflow-20-is\">What DeerFlow 2.0 is</h3>\n<p>DeerFlow, short for Deep Exploration and Efficient Research Flow, began as a deep-research framework. The project’s maintainers then rebuilt it as a broader agent runtime. According to the official materials, DeerFlow 2.0 is a ground-up rewrite built on LangGraph and LangChain, with built-in support for memory, filesystem access, skills, sandboxed execution and sub-agents.</p>\n<p>In practical terms, DeerFlow is not just another chat interface with tools attached. It is better understood as an agent harness: a runtime that can plan work, break it into subtasks, invoke tools, generate and execute code, manage files and return finished outputs. That architecture is what makes it more relevant to serious experimentation than many lighter open-source alternatives.</p>\n<p>Two official references are worth reviewing first:</p>\n<ul>\n<li>Official site: <a href=\"https://deerflow.tech\" target=\"_blank\" rel=\"noopener noreferrer\">deerflow.tech</a></li>\n<li>GitHub repository: <a href=\"https://github.com/bytedance/deer-flow\" target=\"_blank\" rel=\"noopener noreferrer\">github.com/bytedance/deer-flow</a></li>\n</ul>\n<h3 id=\"how-it-works\">How it works</h3>\n<p>You give DeerFlow a goal in plain language. The lead agent then plans the work, divides it into subtasks, invokes supporting tools and, where needed, spawns sub-agents to handle specialized roles. Based on the project documentation and visible demos, DeerFlow 2.0 can:</p>\n<ol>\n<li>Plan and decompose multi-step tasks</li>\n<li>Spawn sub-agents with separate context and responsibilities</li>\n<li>Use search, browsing and file-based workflows</li>\n<li>Write and execute code in a sandboxed environment</li>\n<li>Manage files and directories across a persistent workspace</li>\n<li>Return finished artefacts such as reports, code, dashboards and other outputs</li>\n</ol>\n<p>That is a meaningful step up from agent frameworks that require much more assembly work before they become operational. DeerFlow’s key strength is not that any one individual feature is unique. It is that the project packages several of those features into a more usable starting point.</p>\n<h3 id=\"what-stands-out-technically\">What stands out technically</h3>\n<p>A few characteristics make DeerFlow 2.0 more consequential than the average open-source agent release.</p>\n<p>First, it is delivered as a more complete runtime rather than as a toolkit that expects the user to build the rest. That lowers the barrier to experimentation.</p>\n<p>Second, it supports longer-horizon work. The project’s positioning and demos emphasize tasks that may take minutes or longer, rather than quick prompt-response exchanges.</p>\n<p>Third, it has a stronger execution model than many early agent projects. Filesystem access, skills, memory and sandboxed code execution create a more realistic operating environment for agents.</p>\n<p>Fourth, it appears model-agnostic. The public materials indicate support for multiple OpenAI-compatible endpoints and local model options, which gives teams more flexibility in how they approach privacy, cost and deployment.</p>\n<p>That said, none of those points should be confused with a production-readiness certification. Capability and readiness are not the same thing.</p>\n<h3 id=\"what-it-can-do-now\">What it can do now</h3>\n<p>Based on official materials and public demonstrations, DeerFlow 2.0 is positioned for tasks such as:</p>\n<ul>\n<li>building websites and interactive dashboards from short briefs</li>\n<li>conducting exploratory analysis on datasets</li>\n<li>generating research outputs with citations</li>\n<li>producing documents, slides and content artefacts</li>\n<li>coordinating multi-step software or research workflows</li>\n</ul>\n<p>In testing, the more compelling takeaway is not that DeerFlow can produce flashy outputs. Many tools can do that in curated demos. The more important point is that DeerFlow is trying to operationalize the entire chain from planning to execution to artefact delivery inside one environment. That is why it has generated so much attention.</p>\n<h3 id=\"where-the-current-hype-needs-more-discipline\">Where the current hype needs more discipline</h3>\n<p>This is where the discussion needs to become more precise.</p>\n<p>A number of public claims about DeerFlow are either overstated or not yet sufficiently documented. I would be cautious about repeating unverified assertions about default telemetry behaviour, optional cloud-memory backends, authentication changes in the web UI or broad multilingual performance. Some of those claims may prove correct in specific builds, issues or branches, but they should not be treated as settled facts without direct evidence from the exact version being assessed.</p>\n<p>That point is important well beyond this project. In the agent space, people often blend official documentation, demos, open issues, unmerged pull requests and personal testing into one narrative. That produces enthusiasm, but it does not always produce accuracy.</p>\n<h3 id=\"security-and-privacy-considerations\">Security and privacy considerations</h3>\n<p>For security and privacy professionals, DeerFlow should be treated as an agentic execution platform, not merely as an AI assistant. The relevant control questions are therefore broader and more serious.</p>\n<p><strong>What works in its favour</strong></p>\n<ul>\n<li>It is open source and auditable.</li>\n<li>It supports containerized execution models rather than forcing host-level execution.</li>\n<li>It provides a structured runtime with memory, filesystem access and tool orchestration rather than hiding those behaviours behind a black box.</li>\n<li>It appears suitable for self-hosted deployment patterns.</li>\n</ul>\n<p><strong>What requires scrutiny</strong></p>\n<ul>\n<li><strong>Code execution risk:</strong> The platform can generate and run code. That creates obvious exposure if execution is not isolated properly.</li>\n<li><strong>Prompt injection and tool abuse:</strong> Any system that consumes external content and can invoke tools is exposed to adversarial inputs, malicious instructions and unsafe chaining.</li>\n<li><strong>Outbound data flow:</strong> Prompts, files, outputs and intermediate artefacts may be exposed to whichever model endpoints or external services are configured.</li>\n<li><strong>Secrets handling:</strong> Teams need to understand how credentials are stored, injected, rotated and exposed to tools or generated code.</li>\n<li><strong>Persistence risk:</strong> Memory and workspace persistence can improve usability, but they can also preserve sensitive information longer than intended.</li>\n<li><strong>Supply-chain intake:</strong> Open source improves auditability, but it does not eliminate dependency, image-provenance or update-governance risk.</li>\n<li><strong>Jurisdictional scrutiny:</strong> ByteDance’s ownership and country-of-origin context will trigger additional review in some organizations and sectors, regardless of the code’s functional merits.</li>\n</ul>\n<p>For any enterprise assessment, I would also ask a more basic question: what exactly is the threat model? If the answer is not clear, the evaluation is not complete.</p>\n<h3 id=\"governance-baseline-i-would-recommend\">Governance baseline I would recommend</h3>\n<p>For organizations considering DeerFlow or a similar platform, I would start with a baseline such as this:</p>\n<ul>\n<li>deploy it only in containerized form, with hardened images and restricted privileges</li>\n<li>apply strict network egress controls</li>\n<li>use only approved model backends and approved data paths</li>\n<li>prohibit use with regulated, confidential or customer-sensitive data until governance is complete</li>\n<li>review dependency intake, image provenance and update processes</li>\n<li>define memory retention and workspace retention rules before broader use</li>\n<li>validate authentication, logging and access controls in the exact deployment version</li>\n<li>test for prompt injection, unsafe tool invocation and secrets exposure before production use</li>\n</ul>\n<p>This is not unique to DeerFlow. It is the minimum standard I would apply to any agent platform with code execution, external retrieval and file manipulation capabilities.</p>\n<h3 id=\"compliance-and-legal-context\">Compliance and legal context</h3>\n<p>From a privacy and compliance perspective, the main issue is not whether DeerFlow is open source. The main issue is where data goes, which providers or services can receive it, how long it persists and under which legal and contractual controls it is processed.</p>\n<p>Relevant frameworks will vary by jurisdiction, but teams should think in terms of existing obligations under the European Union’s General Data Protection Regulation, California’s CCPA and CPRA, and Canada’s Personal Information Protection and Electronic Documents Act, along with sector-specific and local rules. In Canada, it is particularly important not to write as though Bill C-27 is already coming into force. It is not current law.</p>\n<p>Legal teams should also look beyond privacy. Agentic systems can introduce issues related to software intake, licensing, intellectual property, auditability, export controls, customer commitments and acceptable use.</p>\n<h3 id=\"how-deerflow-compares-with-the-field\">How DeerFlow compares with the field</h3>\n<p>DeerFlow is not the only project trying to make agents practical, but it is one of the more polished open-source efforts in early 2026. Compared with frameworks that require substantial assembly, it offers a more complete starting environment. Compared with narrower coding-agent projects, it appears to have broader ambition around research, orchestration and output generation.</p>\n<p>Its main advantage is packaging. Its main challenge is trust. Not trust in the narrow sense of whether it works, but trust in the broader sense that matters to businesses: where it runs, what it connects to, how it handles data, how it executes code and whether the surrounding controls are strong enough.</p>\n<h3 id=\"final-assessment\">Final assessment</h3>\n<p>DeerFlow 2.0 is one of the more important open-source agent releases of early 2026. It brings together planning, tools, memory, file handling, sandboxed execution and sub-agent orchestration in a way that makes the platform more usable than many experimental alternatives. That is real progress.</p>\n<p>At the same time, teams should resist the temptation to equate visible momentum with operational maturity. DeerFlow is promising, but it should be assessed like any other high-capability agent platform: carefully, version by version, with explicit controls around execution, data flow, memory, access and software intake.</p>\n<p>If you are exploring agentic systems this year, DeerFlow is worth reviewing. Just make sure your evaluation is grounded in documented facts, not just community excitement.</p>\n<h2 id=\"ethics-statement\">Ethics statement</h2>\n<p>This article is intended to support informed discussion about open-source AI agent platforms, with a particular focus on execution, governance, privacy and security implications. It aims to distinguish clearly between verified project documentation, publicly observable repository information, the author’s hands-on testing and the author’s professional interpretation. Where a feature, control or deployment behaviour is uncertain, version-dependent or not fully documented publicly, that uncertainty is stated rather than assumed away. This article does not endorse deploying autonomous code-execution systems in production without appropriate review, nor does it advocate bypassing legal, contractual, security, privacy or governance requirements.</p>\n<h2 id=\"disclaimer\">Disclaimer</h2>\n<p>This article is provided for general information and discussion purposes only. It is not legal, security, privacy, compliance or professional advice, and it should not be relied upon as such. Open-source software projects, model integrations, feature sets, default configurations and security controls can change quickly, including between releases, commits and deployment methods. Any assessment of DeerFlow or similar tools should be validated against the exact version, configuration, model providers, hosting environment and organizational requirements in scope. Jurisdictional obligations related to privacy, data residency, software supply chain, export controls and sector regulation may also vary materially. Any errors or omissions are unintentional. The views expressed are those of the author in a personal capacity and do not represent the views of any employer, client, partner or affiliated organization. Generative AI tools were used to assist with research and editing.</p>\n<h2 id=\"keywords\">Keywords</h2>\n<p>#DeerFlow #DeerFlow2 #ByteDance #AIAgents #AgenticAI #OpenSourceAI #LangGraph #LangChain #AIInfrastructure #SoftwareAgents #CodingAgents #AgentSecurity #AIGovernance #Privacy #DataProtection #Compliance #PIPEDA #GDPR #CCPA #CPRA #EnterpriseAI #AIPlatform #ContainerSecurity #SupplyChainSecurity #PromptInjection #ModelRisk #DataGovernance #Cybersecurity #Infosec #PrivacyEngineering #DevTools #SelfHostedAI #AILabs #SoftwareSecurity #RiskManagement</p>",
        "date_published": "2026-03-06T15:46:00-04:00",
        "url": "https://kiledjian.com/2026/03/06/deerflow-bytedances-opensource-ai-agent.html",
        "tags": ["Artificial Intelligence"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2026/03/05/heretic-and-the-new-reality.html",
        "title": "Heretic and the new reality of modifiable AI safety  ",
        "content_html": "<p>Open-source large language models have made advanced generative AI broadly accessible. What is changing now is not only model capability, but the ease with which model behaviour can be altered after release — including behaviour that vendors and labs describe as “safety alignment.”</p>\n<p>One of the most visible examples is Heretic, an open-source project that automates the removal of refusal behaviour in transformer-based language models. The project is not subtle about its purpose. It describes itself as “fully automatic censorship removal,” and it is gaining traction quickly.</p>\n<p>This post does not provide instructions for disabling safeguards. Instead, it focuses on what is verifiably true about the tool, the research it is built on, and why this matters for security leaders, developers and governance teams.</p>\n<h2 id=\"what-heretic-is\">What Heretic is</h2>\n<p>Heretic is a Python-based tool that modifies a model to reduce or eliminate refusal responses. It does this through a technique known as directional ablation, commonly referred to in the community as “abliteration.” The tool combines that intervention with automated parameter search using Optuna’s Tree-structured Parzen Estimator (TPE) optimiser.</p>\n<p>In practical terms, Heretic aims to find settings that reduce refusals while keeping the modified model close to the original model’s behaviour on benign prompts. The project describes this trade-off explicitly as co-minimizing refusal counts and KL divergence.</p>\n<p>Project home:<br>\n<a href=\"https://github.com/p-e-w/heretic\" target=\"_blank\" rel=\"noopener noreferrer\">github.com/p-e-w/her&hellip;</a></p>\n<p>A key point many summaries miss is licensing. Heretic is licensed under the GNU Affero General Public License (AGPL) v3.0. That is not a permissive licence. It has real implications for anyone who plans to modify and run the software in networked environments.</p>\n<h2 id=\"what-it-is-built-on-the-refusal-direction-research\">What it is built on: the “refusal direction” research</h2>\n<p>Heretic’s core premise follows mechanistic interpretability research published in 2024: “Refusal in Language Models Is Mediated by a Single Direction,” by Arditi et al.</p>\n<p>In that work, researchers found that refusal behaviour in multiple popular chat models can be linked to a one-dimensional subspace in the residual stream. They demonstrate that removing that direction reduces refusals, while adding it can induce refusals even for harmless requests. The broader conclusion is uncomfortable but important: current alignment methods can be brittle, and model behaviour can sometimes be controlled through targeted internal interventions rather than retraining.</p>\n<p>Paper (Arditi et al.):<br>\n<a href=\"https://arxiv.org/abs/2406.11717\" target=\"_blank\" rel=\"noopener noreferrer\">arxiv.org/abs/2406&hellip;.</a></p>\n<h2 id=\"how-heretic-differs-from-earlier-abliteration-workflows\">How Heretic differs from earlier abliteration workflows</h2>\n<p>Abliteration itself is not new. What Heretic productizes is automation and repeatability.</p>\n<p>Earlier approaches often required manual experimentation: selecting layers, choosing projection strengths and validating results with ad hoc tests. Heretic packages that into an optimiser-driven workflow. It searches parameter combinations to reduce refusals and limit behavioural drift, using quantitative measures as guardrails.</p>\n<p>This is one of the reasons it is being discussed widely. Automation lowers the barrier from “researcher with time” to “user with a capable workstation.”</p>\n<h2 id=\"what-the-project-and-evaluations-actually-show\">What the project and evaluations actually show</h2>\n<p>Two claims circulate frequently: that Heretic can drive refusals close to zero, and that it can do so while preserving most baseline capabilities.</p>\n<p>The project’s own documentation includes examples where Heretic-generated models show refusal suppression comparable to other abliterations, with lower KL divergence in that specific comparison. The documentation also stresses that numerical results vary by hardware and software environment and that benchmarks are not a substitute for human evaluation.</p>\n<p>Independent evaluation work in late 2025 compared Heretic to other abliteration tools across a range of instruction-tuned models. The headline finding was not that any tool is perfect, but that trade-offs are real and model-dependent. The same paper also cautions that controlled benchmarks do not necessarily predict long-run behaviour in multi-turn use.</p>\n<p>Comparative analysis paper (Young et al.):<br>\n<a href=\"https://arxiv.org/abs/2512.13655\" target=\"_blank\" rel=\"noopener noreferrer\">arxiv.org/abs/2512&hellip;.</a></p>\n<p>A consistent theme across reports is that structured reasoning tasks are among the most sensitive. In other words, removing refusals can be technically achievable, but retaining all capabilities is not guaranteed. This should be treated as an engineering problem, not an assumption.</p>\n<h2 id=\"community-adoption-and-the-pace-of-iteration\">Community adoption and the pace of iteration</h2>\n<p>Heretic’s repository shows rapid iteration and strong adoption. Discussion threads on r/LocalLLaMA track releases and performance claims, including changes aimed at reducing VRAM requirements and improving model-loading flexibility. There is also active discussion about false positives in refusal detection and the limits of simple refusal scoring.</p>\n<p>Example discussion threads:<br>\n<a href=\"https://www.reddit.com/r/LocalLLaMA/comments/1oymku1/heretic_fully_automatic_censorship_removal_for/\" target=\"_blank\" rel=\"noopener noreferrer\">www.reddit.com/r/LocalLL&hellip;</a><br>\n<a href=\"https://www.reddit.com/r/LocalLLaMA/comments/1r4n3as/heretic_12_released_70_lower_vram_usage_with/\" target=\"_blank\" rel=\"noopener noreferrer\">www.reddit.com/r/LocalLL&hellip;</a></p>\n<p>This matters because the practical capability is not only the tool, but the ecosystem it enables: repeatable creation and distribution of modified models.</p>\n<h2 id=\"why-this-matters-for-enterprise-security-and-governance\">Why this matters for enterprise security and governance</h2>\n<p>From an enterprise perspective, Heretic is less a novelty and more a signal.</p>\n<p>First, it reinforces that “model safety” is not a reliable control boundary. If a model can be modified to remove refusals, then system safety must be enforced through architecture: data controls, identity, rate limiting, monitoring, output filtering and purpose-built guardrails at the application layer.</p>\n<p>Second, it complicates third-party risk assumptions. If an organisation relies on aligned behaviour as a compliance or safety control, it should assume that aligned behaviour can be bypassed when models are run locally or in uncontrolled environments.</p>\n<p>Third, it raises governance and legal questions. If an organisation modifies and serves software under AGPL, that triggers obligations. Separately, deploying modified models without clear controls can raise policy and regulatory concerns, depending on use case, jurisdiction and sector.</p>\n<p>A practical way to think about it is simple: treat model alignment as a property that can change, and treat safety as something you must engineer end-to-end.</p>\n<h2 id=\"bottom-line\">Bottom line</h2>\n<p>Heretic is a credible, fast-moving implementation of a well-known research insight: refusal behaviour can be represented in low-dimensional directions and suppressed through targeted intervention. It is also a reminder that safety alignment, as currently implemented in many open models, is not an immutable feature.</p>\n<p>For security leaders, the right response is not panic and not denial. It is disciplined control design. Assume models can be modified. Build safety at the system level.</p>\n<p>Sources<br>\nHeretic repository: <a href=\"https://github.com/p-e-w/heretic\" target=\"_blank\" rel=\"noopener noreferrer\">github.com/p-e-w/her&hellip;</a><br>\nArditi et al. (2024): <a href=\"https://arxiv.org/abs/2406.11717\" target=\"_blank\" rel=\"noopener noreferrer\">arxiv.org/abs/2406&hellip;.</a><br>\nOptuna TPE sampler documentation: <a href=\"https://optuna.readthedocs.io/en/stable/reference/samplers/generated/optuna.samplers.TPESampler.html\" target=\"_blank\" rel=\"noopener noreferrer\">optuna.readthedocs.io/en/stable&hellip;</a><br>\nYoung et al. (2025): <a href=\"https://arxiv.org/abs/2512.13655\" target=\"_blank\" rel=\"noopener noreferrer\">arxiv.org/abs/2512&hellip;.</a><br>\nCommunity threads:<br>\n<a href=\"https://www.reddit.com/r/LocalLLaMA/comments/1oymku1/heretic_fully_automatic_censorship_removal_for/\" target=\"_blank\" rel=\"noopener noreferrer\">www.reddit.com/r/LocalLL&hellip;</a><br>\n<a href=\"https://www.reddit.com/r/LocalLLaMA/comments/1r4n3as/heretic_12_released_70_lower_vram_usage_with/\" target=\"_blank\" rel=\"noopener noreferrer\">www.reddit.com/r/LocalLL&hellip;</a></p>\n<p>Keywords: #AI #ArtificialIntelligence #LLM #LargeLanguageModels #MachineLearning #GenerativeAI #AIResearch #AIAlignment #AISafety #AIsecurity #CyberSecurity #InfoSec #EnterpriseSecurity #RiskManagement #AIGovernance #AIRegulation #ResponsibleAI #TechPolicy #DigitalRisk #ModelSecurity #AITrends #AIInnovation #AIethics #OpenSourceAI #DeepLearning #TransformerModels #DataSecurity #ThreatLandscape #SecurityLeadership #CISO #FutureOfAI #EmergingTech #TechStrategy #SecurityStrategy #CyberRisk</p>\n<img src=\"https://cdn.uploads.micro.blog/255457/2026/chatgpt-image-mar-5-2026-at-09-20-51-am.png\">",
        "date_published": "2026-03-05T12:21:00-04:00",
        "url": "https://kiledjian.com/2026/03/05/heretic-and-the-new-reality.html",
        "tags": ["Artificial Intelligence"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2026/03/01/are-dorseys-giant-job-cuts.html",
        
        "content_html": "<p><a href=\"https://www.cnbc.com/2026/02/27/are-dorseys-giant-job-cuts-the-start-of-an-ai-jobs-apocalypse-economists-weigh-in.html\" target=\"_blank\" rel=\"noopener noreferrer\">Are Dorsey&rsquo;s giant job cuts the start of an AI jobs apocalypse? Economists weigh in</a></p>\n<p>Block CEO Jack Dorsey&rsquo;s decision to cut nearly half the company&rsquo;s workforce raises questions about AI&rsquo;s impact on jobs, but economists suggest this is a company-specific adjustment rather than a sign of a broader labor market shift. While AI may disrupt some jobs, experts like Claudia Sahm emphasize that it doesn&rsquo;t necessarily lead to mass layoffs, and other economists believe AI will enhance productivity by changing workflows rather than eliminating jobs outright.</p>\n",
        "date_published": "2026-03-01T12:08:00-04:00",
        "url": "https://kiledjian.com/2026/03/01/are-dorseys-giant-job-cuts.html",
        "tags": ["Artificial Intelligence","Technology \u0026 Business"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2026/01/06/openai-is-rolling-out-gpt.html",
        
        "content_html": "<p><a href=\"https://www.bleepingcomputer.com/news/artificial-intelligence/openai-is-rolling-out-gpt-52-codex-max-for-some-users/\" target=\"_blank\" rel=\"noopener noreferrer\">OpenAI is rolling out GPT-5.2 “Codex-Max” for some users</a></p>\n<p>OpenAI is rolling out GPT-5.2-Codex-Max, a new model for its Codex service, to select subscribers. This advanced version is expected to offer enhanced capabilities for long tasks, context management, and improved reliability, particularly with tool use and understanding visual inputs like screenshots.</p>\n",
        "date_published": "2026-01-06T23:27:00-04:00",
        "url": "https://kiledjian.com/2026/01/06/openai-is-rolling-out-gpt.html",
        "tags": ["Artificial Intelligence"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2026/01/06/the-stein-standard-what-the.html",
        "title": "The \"Stein Standard\": What the OpenAI ruling means for privacy and discovery  ",
        "content_html": "<p>On Jan. 5, 2026, U.S. District Judge Sidney Stein affirmed a significant discovery order requiring OpenAI to produce 20 million de-identified ChatGPT conversation logs to plaintiffs in the consolidated copyright litigation involving The New York Times and other publishers.</p>\n<p>As security and privacy professionals, we often warn about &ldquo;Shadow AI&rdquo; and data leakage. This ruling makes those risks concrete. Here is a balanced analysis of what happened and what it means for Canadian organizations.</p>\n<p>What the court ordered</p>\n<ul>\n<li>OpenAI must produce a sample of 20 million de-identified ChatGPT logs.</li>\n<li>The requested period is Dec. 2022 to Nov. 2024.</li>\n<li>OpenAI’s objections on privacy risk and undue burden were rejected for discovery purposes.</li>\n</ul>\n<p>Scope and safeguards</p>\n<ul>\n<li>Scope: 20 million logs (roughly 0.05 per cent of retained data).</li>\n<li>Safeguards: De-identified data produced under a strict &ldquo;Attorneys&rsquo; Eyes Only&rdquo; protective order.</li>\n</ul>\n<p>Important context</p>\n<ul>\n<li>This is a discovery ruling, not a final decision on copyright infringement.</li>\n<li>This is not a public release of data. The logs are restricted to opposing counsel for analysis.</li>\n</ul>\n<p>Why this matters: The VP perspective<br>\nHere are three takeaways for data governance leaders:</p>\n<ol>\n<li>\n<p>The &ldquo;wiretap&rdquo; distinction<br>\nJudge Stein distinguished ChatGPT interactions from private phone calls (protected under wiretap laws). The court noted that users voluntarily disclose information to a third-party AI, effectively narrowing the expectation of privacy compared to traditional communications.</p>\n</li>\n<li>\n<p>De-identification does not equal anonymity<br>\nWhile the court accepted de-identification as a safeguard for discovery, privacy professionals know this is not a silver bullet. Watch closely to see whether safeguards hold up against adversarial re-identification techniques once data is shared.</p>\n</li>\n<li>\n<p>Discovery is reality<br>\nThis establishes a high-water mark for AI litigation. &ldquo;Big Data&rdquo; is no longer a shield against discovery; courts are willing to compel production of massive datasets if they deem it relevant.</p>\n</li>\n</ol>\n<p>The takeaway<br>\nAssume your inputs into public AI models are discoverable, and govern usage accordingly.</p>\n<p>For Canadian organizations, while this is a U.S. ruling, it impacts the global platforms we rely on. It is a timely prompt to review retention practices and reinforce acceptable-use expectations, especially for sensitive or confidential information.</p>\n<p>How is this shifting your approach to AI governance and acceptable use policies?</p>\n<p>#Privacy #CISO #AI #DataGovernance #LegalTech #CdnTech</p>\n<p>Disclaimer: The views expressed in this post are my own and do not necessarily reflect the official policy or position of my employer. This commentary is based on publicly available information and is provided for informational purposes only. It does not constitute legal advice.</p>\n<p>Keyword: #OpenAI #ChatGPT #SDNY #JudgeStein #Discovery #eDiscovery #Privacy #Cybersecurity #Copyright #CopyrightLitigation #NYTimes #AIGovernance #AcceptableUse #ShadowAI #DataLeakage #DeIdentification #Anonymity #AttorneysEyesOnly #ProtectiveOrder #LegalProcess #Proportionality #DataRetention #RetentionPolicy #DataClassification #DLP #EnterpriseAI #RiskManagement #Compliance #Governance #CanadianTech #CrossBorderData #PrivacyByDesign #ReIdentification #Metadata #Confidentiality #LegalRisk</p>\n<img src=\"https://cdn.uploads.micro.blog/255457/2026/chatgpt-image-jan-6-2026-at-08-23-38-am.png\">",
        "date_published": "2026-01-06T11:23:00-04:00",
        "url": "https://kiledjian.com/2026/01/06/the-stein-standard-what-the.html",
        "tags": ["Artificial Intelligence","Cybersecurity \u0026 Privacy"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2026/01/04/french-authorities-investigate-ai-undressing.html",
        
        "content_html": "<p><a href=\"https://securityaffairs.com/186460/ai/french-authorities-investigate-ai-undressing-deepfakes-on-x.html\" target=\"_blank\" rel=\"noopener noreferrer\">French authorities investigate AI ‘undressing’ deepfakes on X</a></p>\n<p>French authorities are investigating AI-generated deepfakes on X after hundreds of women and teens reported non-consensual sexually explicit images created using the Grok chatbot. This investigation is part of an existing probe into X, with potential penalties including prison time and fines.</p>\n",
        "date_published": "2026-01-04T01:34:00-04:00",
        "url": "https://kiledjian.com/2026/01/04/french-authorities-investigate-ai-undressing.html",
        "tags": ["Artificial Intelligence","Cybersecurity \u0026 Privacy"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2026/01/02/the-per-cent-myth-why.html",
        "title": "The \"10 Per Cent\" Myth: Why AI Capability Does Not Equal a Pink Slip",
        "content_html": "<p>The headlines are everywhere, and they are designed to stop your scroll: &ldquo;AI to Replace 1/10 of the Workforce.&rdquo;</p>\n<p>It is a terrifying number. It represents millions of livelihoods reduced to a statistic. But as a chief information security officer, I do not deal in headlines. I deal in risk, audits and rigorous data analysis.</p>\n<p>When you strip away the hype and audit the primary sources released in late 2025—specifically from Project Iceberg (MIT), Yale and McKinsey—a completely different reality emerges.</p>\n<p>We are confusing technical exposure with actual displacement.</p>\n<p>Here is the fact-based reality of the AI labour market as we enter 2026.</p>\n<ol>\n<li>The Audit: Capability vs. Likelihood</li>\n</ol>\n<p>The viral &ldquo;10 per cent&rdquo; statistic stems from Project Iceberg (led by MIT and partners), published in November 2025. Researchers found that AI has the technical capability to automate tasks representing 11.7 per cent of the U.S. economy&rsquo;s wage value.</p>\n<p>In the world of risk assessment, however, capability is only half the equation. You must also calculate likelihood.</p>\n<p>Just because a task can be automated does not mean it will be today. History proves that the gap between technical feasibility and widespread adoption is measured in decades, not fiscal quarters. Cloud has been enterprise-viable for well over a decade, yet a vast portion of enterprise workloads remain on premises.</p>\n<p>The Reality: In 2025, while AI could theoretically perform the work of millions, announced job-cut plans explicitly attributed to AI totaled approximately 55,000 through November (Source: Challenger, Gray &amp; Christmas).</p>\n<p>The Context: That represents approximately 0.03 per cent of the U.S. labour force. The theoretical avalanche is, in practice, a statistical rounding error.</p>\n<ol start=\"2\">\n<li>The &ldquo;Zero Disruption&rdquo; Verdict</li>\n</ol>\n<p>If 10 per cent of jobs were vanishing, the macroeconomic data would be screaming. Instead, it is barely whispering.</p>\n<p>A comprehensive study by Yale University’s Budget Lab (October 2025) analyzed labour market data from the launch of ChatGPT in late 2022 through to late 2025. Their conclusion was blunt: &ldquo;No discernible disruption.&rdquo;</p>\n<p>Three years into the generative AI revolution, aggregate data shows stability, not collapse. We are not witnessing a displacement crisis; we are witnessing a retooling phase.</p>\n<ol start=\"3\">\n<li>It Is Not About Jobs—It Is About Tasks</li>\n</ol>\n<p>The most critical distinction lost in the media noise is the difference between a job and a task.</p>\n<p>A job is a complex bundle of responsibilities. Some are routine (data entry, scheduling, basic coding). Others require judgment, empathy, strategy and accountability. AI is exceptional at the former and still limited in the latter.</p>\n<p>McKinsey’s November 2025 analysis suggests that while over half of work hours are exposed to automation, this typically results in augmentation, not replacement. When AI automates 20 per cent of your routine tasks, you do not lose your job; you gain 20 per cent of your capacity back to focus on high-value work that algorithms cannot touch.</p>\n<ol start=\"4\">\n<li>The Hidden Risk: Geography</li>\n</ol>\n<p>While the media focuses on Silicon Valley, Project Iceberg reveals a &ldquo;hidden&rdquo; risk. The study distinguishes between &ldquo;Surface Index&rdquo; exposure (visible technology roles) and &ldquo;Hidden&rdquo; exposure (administrative and financial back-office roles).</p>\n<p>The data shows that states with heavy financial and administrative sectors—like Delaware in the U.S.—have higher theoretical exposure than pure technology hubs.</p>\n<p>The Canadian Implication: Applying this logic to Canada, the financial corridors of Toronto and our administrative centres likely face higher exposure than our tech hubs. This suggests the transition will be a slow, quiet evolution of white-collar workflows, not a sudden &ldquo;tech bubble&rdquo; burst.</p>\n<p>The Bottom Line</p>\n<p>Is the labour market changing? Absolutely. Is 10 per cent of the workforce being replaced tomorrow? The data says no.</p>\n<p>The 11.7 per cent figure is a map of exposure, not a forecast of unemployment. It tells us what could change, not what is changing next Tuesday.</p>\n<p>The risk isn&rsquo;t that AI will take your job overnight. The risk is failing to learn the tools that will define the next decade. As professionals, we need to move past the fear of replacement and focus on fluency.</p>\n<p>Map tasks, not titles. Measure adoption, not headlines.</p>\n<p>Data sources: Project Iceberg/MIT (Nov. 2025); Yale Budget Lab (Oct. 2025); McKinsey Global Institute (Nov. 2025); Challenger, Gray &amp; Christmas (2025).</p>\n<p>Disclaimer &amp; Ethics Statement: This article was drafted with the assistance of AI tools to synthesize large datasets from the cited reports (MIT, Yale, McKinsey). All data points, logic and conclusions were independently audited and verified by the human author.<br>\nThe content provided here is for informational purposes only and does not constitute career or financial advice.<br>\nThe views expressed are my own and do not necessarily reflect the official policy or position of my employer.</p>\n<p>#AI #FutureOfWork #CISO #RiskManagement #CanadianBusiness</p>\n<p>Keywords: #AI #ArtificialIntelligence #FutureOfWork #WorkforceTransformation #DigitalTransformation #Productivity #Automation #Augmentation #Jobs #Skills #Reskilling #Upskilling #Leadership #Strategy #Innovation #RiskManagement #CISO #CyberSecurity #Governance #Compliance #Audit #DataDriven #EvidenceBased #TechPolicy #LabourMarket #EconomicTrends #Workplace #Operations #ChangeManagement #Canada #CanadianBusiness #Toronto #McKinsey #MIT #Yale</p>\n<img src=\"https://cdn.uploads.micro.blog/255457/2026/chatgpt-image-jan-2-2026-at-09-30-32-am.png\">",
        "date_published": "2026-01-02T12:30:00-04:00",
        "url": "https://kiledjian.com/2026/01/02/the-per-cent-myth-why.html",
        "tags": ["Artificial Intelligence","Leadership \u0026 Mindset"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/12/23/the-delete-button-is-a.html",
        "title": "The ‘Delete’ Button Is a Lie: A Canadian’s Guide to AI Data Retention  ",
        "content_html": "<p>When you hit &ldquo;delete&rdquo; on a conversation with ChatGPT or Gemini, you likely expect it to vanish. In reality, that data often enters a digital limbo—accessible to the provider for 30 days, three years, or even seven years for certain safety-classifier metadata, depending on the fine print you didn&rsquo;t read.</p>\n<p>For paid subscribers, the assumption of privacy is dangerous. While corporate &ldquo;Team&rdquo; and &ldquo;Enterprise&rdquo; plans typically offer stronger contractual controls (including training restrictions and admin-managed retention), &ldquo;Pro&rdquo; and &ldquo;Plus&rdquo; users are frequently treated as consumers with slightly better perks, not better privacy.</p>\n<p>Here is the verified reality of data deletion for the four major large language models (LLMs) available in Canada.</p>\n<h2 id=\"chatgpt-openai\">ChatGPT (OpenAI)</h2>\n<p><strong>The Plans:</strong> Free, Plus and Pro (personal workspaces)<br>\n<strong>The Default:</strong> <strong>Opt-out required.</strong> OpenAI enables data sharing by default for these tiers. Unless you opt out, your conversations can be used to train future models.</p>\n<p><strong>The Reality:</strong><br>\nOpenAI deletes conversations from its systems within <strong>30 days</strong> of you deleting them. However, this is not absolute. OpenAI explicitly states that data may be retained longer if required by law—a significant caveat given 2025’s litigation landscape involving copyright and data usage.</p>\n<p><strong>The Catch:</strong></p>\n<ul>\n<li><strong>Legal Holds:</strong> If your account is subject to a preservation order, &ldquo;deleted&rdquo; data may be archived until the legal matter resolves. For example, during 2025 copyright litigation, a preservation order required the retention of certain consumer data between April and September; OpenAI later stated the order ended Sept. 26, 2025, with limited historical data retained under secure hold.</li>\n<li><strong>Temporary Chat:</strong> Using the &ldquo;Temporary Chat&rdquo; toggle prevents the conversation from appearing in your history, but OpenAI retains these chats for up to 30 days specifically to monitor for abuse.</li>\n<li><strong>Training vs. Retention:</strong> Deleting a chat <em>after</em> it has been used to train the model does not untrain the model.</li>\n</ul>\n<p><strong>Your Move:</strong> Go to <strong>Settings &gt; Data Controls</strong> and toggle &ldquo;Improve the model for everyone&rdquo; to <strong>OFF</strong>. This is the primary way to ensure your future chats are not ingested into the &ldquo;brain&rdquo; of future GPT versions.</p>\n<h2 id=\"claude-anthropic\">Claude (Anthropic)</h2>\n<p><strong>The Plan:</strong> Claude Pro<br>\n<strong>The Default:</strong> <strong>Opt-out required.</strong> In a policy update announced Aug. 28, 2025 (with an Oct. 8 decision deadline for existing users), Anthropic introduced specific provisions for training data retention.</p>\n<p><strong>The Reality:</strong><br>\nIf you allow Anthropic to use your data for model improvement, your conversations may be retained for up to <strong>five years</strong> in their training pipelines. If you opt out, deleted conversations are removed from backend systems within 30 days.</p>\n<p><strong>The Catch:</strong></p>\n<ul>\n<li><strong>The 5-Year Pipeline:</strong> The five-year retention applies to data used for &ldquo;benchmarking and model improvement.&rdquo; If you missed the notification to opt out, your historical data may already be in this pipeline.</li>\n<li><strong>Safety &amp; Compliance:</strong> Even if you opt out of training, Anthropic retains data flagged by its Trust &amp; Safety classifiers for up to <strong>two years</strong>. Critical safety data, such as &ldquo;classifier scores&rdquo; (metadata about <em>why</em> a prompt was flagged), can be kept for up to <strong>seven years</strong>.</li>\n</ul>\n<p><strong>Your Move:</strong> Go to <strong>Settings &gt; Privacy</strong> immediately and ensure the &ldquo;Help improve Claude&rdquo; toggle is turned <strong>OFF</strong>.</p>\n<h2 id=\"gemini-google\">Gemini (Google)</h2>\n<p><strong>The Plan:</strong> Gemini Advanced (Google One AI Premium)<br>\n<strong>The Default:</strong> <strong>18-month retention.</strong> By default, Google retains your Gemini Apps Activity for 18 months, similar to your Search history.</p>\n<p><strong>The Reality:</strong><br>\nYou can change your auto-delete setting to 3 months or delete individual chats manually. However, Google’s backend processing creates persistent copies. Even if you turn &ldquo;Gemini Apps Activity&rdquo; <strong>OFF</strong> entirely, Google retains conversations for up to <strong>72 hours</strong> to maintain service continuity and process feedback.</p>\n<p><strong>The Catch:</strong></p>\n<ul>\n<li><strong>The Human Review Trap:</strong> This is the most critical risk. Google disconnects specific chats to be read by human reviewers. Once a chat is selected for review, it is &ldquo;disconnected&rdquo; (disassociated) from your account and retained for up to <strong>three years</strong>.</li>\n<li><strong>Irreversible:</strong> Because these reviewed chats are technically separated from your user ID, deleting the original conversation from your history does <em>not</em> delete the copy held by the human review team.</li>\n</ul>\n<p><strong>Your Move:</strong> Go to **<a href=\"https://myactivity.google.com/product/gemini**.\" target=\"_blank\" rel=\"noopener noreferrer\">myactivity.google.com/product/g&hellip;</a> Set the Auto-delete option to <strong>3 months</strong> (the minimum) and strictly avoid putting sensitive identifiers in your prompts.</p>\n<h2 id=\"grok-xai\">Grok (xAI)</h2>\n<p><strong>The Plan:</strong> Grok Premium (X Premium)<br>\n<strong>The Default:</strong> <strong>Verify your settings.</strong> xAI’s consumer policy allows for model training unless you intervene.</p>\n<p><strong>The Reality:</strong><br>\nGrok offers a &ldquo;Private Chat&rdquo; mode (often indicated by a ghost icon or distinct toggle) which is intended to be ephemeral. Standard chats (non-private) may be used for training. xAI states that deleted data is removed from accessible systems within 30 days.</p>\n<p><strong>The Catch:</strong></p>\n<ul>\n<li><strong>The Feedback Loop:</strong> Even if you opt out of general training, xAI notes that if you voluntarily submit feedback (like rating a response), that specific data may still be used for model improvement.</li>\n<li><strong>Platform overlap:</strong> If you access Grok via X (formerly Twitter), your data handling is governed by X’s broader privacy terms, which can differ from xAI’s standalone app policies.</li>\n</ul>\n<p><strong>Your Move:</strong> You have two options for privacy: exclusively use &ldquo;Private Chat,&rdquo; or verify your &ldquo;Data Sharing&rdquo; settings (typically found under Privacy &amp; Safety on X) to ensure you have unchecked the box allowing your data to be used for model training.</p>\n<h2 id=\"summary-the-safe-deletion-window\">Summary: The ‘Safe’ Deletion Window</h2>\n<ul>\n<li><strong>Claude:</strong> Deleted conversations are removed from backend systems within 30 days. <em>Risk:</em> <strong>Seven-year retention</strong> for safety classifier scores; five years for training-pipeline data if you do not opt out.</li>\n<li><strong>ChatGPT:</strong> Takes 30 days to delete. <em>Risk:</em> &ldquo;Temporary&rdquo; chats are still monitored for 30 days; legal holds can override deletion.</li>\n<li><strong>Grok:</strong> Takes 30 days to delete. <em>Risk:</em> Voluntary feedback can be used for model improvement even if you opt out of general training.</li>\n<li><strong>Gemini:</strong> Auto-delete can be set to 3, 18 or 36 months (user setting). <em>Risk:</em> Human-reviewed data is kept for <strong>three years</strong> and cannot be deleted by the user.</li>\n</ul>\n<h2 id=\"final-advice-for-canadian-users\">Final Advice for Canadian Users</h2>\n<p>While the <em>Personal Information Protection and Electronic Documents Act</em> (PIPEDA) imposes accountability standards on how companies handle Canadian data, it does not prevent cross-border processing. In practice, once your data sits on a server in Oregon or Iowa, U.S. legal frameworks—and subpoenas—may compel disclosure, even where Canadian expectations differ.</p>\n<p>For absolute security, the data must never leave your device. If you must use cloud AI, assume that &ldquo;Deleted&rdquo; actually means &ldquo;Archived for 30 days,&rdquo; and plan accordingly.</p>\n<h2 id=\"ethics-statement--disclaimer\">Ethics Statement &amp; Disclaimer</h2>\n<p><strong>Ethics Statement:</strong> This article is editorial content. The author has no financial relationship with OpenAI, Anthropic, Google or xAI. No company paid to be included in this post, nor did they review the content prior to publication. I personally subscribe to these services to test them objectively.</p>\n<p><strong>Disclaimer:</strong> The information in this post is based on terms of service and privacy policies available as of Dec. 23, 2025. AI companies frequently update their data retention policies without direct notification. The steps provided above are accurate at the time of writing but may change. This post is for informational purposes only and does not constitute legal or professional advice. Readers should consult their organization&rsquo;s legal or security teams before using consumer AI tools for sensitive work.</p>\n<p>Keywords: #AI #DataPrivacy #Cybersecurity #InfoSec #Privacy #DataRetention #DigitalPrivacy #PIPEDA #Canada #Compliance #RiskManagement #SecurityAwareness #DataProtection #CloudSecurity #AIRegulation #TrustAndSafety #LLM #ChatGPT #ClaudeAI #GoogleGemini #Grok #xAI #OpenAI #Anthropic #Google #PrivacyByDesign #Governance #GRC #SecurityPolicy #DataGovernance #CyberRisk #TechPolicy #PrivacyTech #DigitalRights #InfoPrivacy</p>",
        "date_published": "2025-12-23T12:37:00-04:00",
        "url": "https://kiledjian.com/2025/12/23/the-delete-button-is-a.html",
        "tags": ["Artificial Intelligence","Cybersecurity \u0026 Privacy"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/12/22/chinas-open-ai-models-are.html",
        "title": "China's open AI models are in a dead heat with the West",
        "content_html": "<p>China&rsquo;s open AI models are in a dead heat with the West - here&rsquo;s what happens\nnext\n<a href=\"https://www.zdnet.com/article/china-open-ai-models-versus-us-llms-power-performance-compared/\" target=\"_blank\" rel=\"noopener noreferrer\">www.zdnet.com/article/c&hellip;</a></p>\n<p>With the rising technological prowess and greater openness of Chinese models,\nthe world is increasingly turning to the East for efficient and customizable\nAI, a new report finds.</p>\n<p>ZDNET&rsquo;s key takeaways:</p>\n<ul>\n<li>Chinese AI models have caught up to US models in power and performance.</li>\n<li>China is leading in model openness.</li>\n<li>Much of the world may adopt the freely available Chinese technology.</li>\n</ul>\n",
        "date_published": "2025-12-22T11:17:00-04:00",
        "url": "https://kiledjian.com/2025/12/22/chinas-open-ai-models-are.html",
        "tags": ["Artificial Intelligence"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/12/19/coursera-to-buy-udemy-creating.html",
        
        "content_html": "<p><a href=\"https://www.reuters.com/business/coursera-udemy-merge-deal-valuing-combined-firm-25-billion-2025-12-17/\" target=\"_blank\" rel=\"noopener noreferrer\">Coursera to buy Udemy, creating $2.5 billion firm to target AI training | Reuters</a></p>\n<p>Coursera announced an all-stock deal to acquire Udemy, valuing the combined company at $2.5 billion. The merger aims to strengthen their position in corporate workforce training, particularly in AI, data science, and software development. The deal is expected to close in the second half of next year, pending regulatory and shareholder approvals.</p>\n",
        "date_published": "2025-12-19T17:13:00-04:00",
        "url": "https://kiledjian.com/2025/12/19/coursera-to-buy-udemy-creating.html",
        "tags": ["Artificial Intelligence","Technology \u0026 Business"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/12/19/managing-agentic-ai-risk-lessons.html",
        
        "content_html": "<p><a href=\"https://www.csoonline.com/article/4109123/managing-agentic-ai-risk-lessons-from-the-owasp-top-10.html\" target=\"_blank\" rel=\"noopener noreferrer\">Managing agentic AI risk: Lessons from the OWASP Top 10 | CSO Online</a></p>\n<p>The OWASP Top 10 for Agentic AI provides a framework to address the growing security risks associated with agentic AI adoption, offering practical guidance, threat taxonomies, and mitigation strategies for CISOs. While the list is immediately useful, some areas like detailed mitigation steps and attack likelihood require further development.</p>\n",
        "date_published": "2025-12-19T09:51:00-04:00",
        "url": "https://kiledjian.com/2025/12/19/managing-agentic-ai-risk-lessons.html",
        "tags": ["Artificial Intelligence","Cybersecurity \u0026 Privacy"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/12/15/microsoft-scales-back-ai-goals.html",
        
        "content_html": "<p><a href=\"https://www.extremetech.com/computing/microsoft-scales-back-ai-goals-because-almost-nobody-is-using-copilot\" target=\"_blank\" rel=\"noopener noreferrer\">Microsoft Scales Back AI Goals Because Almost Nobody Is Using Copilot | Extremetech</a></p>\n<p>Microsoft has reportedly scaled back AI goals for its Copilot software due to low user adoption and sales, with some targets cut by 50%. While Microsoft disputes the sales quota claims, AI agents have shown low success rates in tasks, and Copilot lags behind competitors like ChatGPT and Google&rsquo;s Gemini in market share.</p>\n",
        "date_published": "2025-12-15T09:44:00-04:00",
        "url": "https://kiledjian.com/2025/12/15/microsoft-scales-back-ai-goals.html",
        "tags": ["Artificial Intelligence","Technology \u0026 Business"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/12/13/i-tested-chatgpt-vs-gemini.html",
        
        "content_html": "<p><a href=\"https://www.tomsguide.com/ai/i-tested-chatgpt-5-2-vs-gemini-3-0-with-7-real-world-prompts-heres-the-winner\" target=\"_blank\" rel=\"noopener noreferrer\">I tested ChatGPT-5.2 vs Gemini 3.0 with 7 real-world prompts — here&rsquo;s the winner | Tom&rsquo;s Guide</a></p>\n<p>In a comparison of ChatGPT-5.2 and Gemini 3.0 across seven real-world prompts, ChatGPT-5.2 emerged as the overall winner, demonstrating superior emotional intelligence and psychological insight in its responses. While Gemini 3.0 excelled in specific areas like risk assessment and technical explanations, ChatGPT-5.2 consistently provided more human-like, wise, and grounding answers.</p>\n",
        "date_published": "2025-12-13T02:23:00-04:00",
        "url": "https://kiledjian.com/2025/12/13/i-tested-chatgpt-vs-gemini.html",
        "tags": ["Artificial Intelligence"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/12/02/autonomously-finding-ffmpeg-vulnerabilities-with.html",
        
        "content_html": "<p><a href=\"https://zeropath.com/blog/autonomously-finding-7-ffmpeg-vulnerabilities-with-ai-2025\" target=\"_blank\" rel=\"noopener noreferrer\">Autonomously Finding 7 FFmpeg Vulnerabilities With AI - ZeroPath Blog | ZeroPath</a></p>\n<p>This document details seven vulnerabilities found in FFmpeg, including buffer overflows and invalid frees, stemming from issues like integer truncation, unbounded serialization, off-by-one errors, and incorrect stream indexing. ZeroPath&rsquo;s AI SAST identified these by analyzing allocation and copy alignment, framing invariants, packet builder capacities, cardinality propagation, and offset arithmetic integrity, often bypassing limitations of traditional fuzzers and static analysis tools.</p>\n",
        "date_published": "2025-12-02T20:47:00-04:00",
        "url": "https://kiledjian.com/2025/12/02/autonomously-finding-ffmpeg-vulnerabilities-with.html",
        "tags": ["Artificial Intelligence","Cybersecurity \u0026 Privacy"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/12/02/poetry-can-trick-ai-models.html",
        
        "content_html": "<p><a href=\"https://www.the-independent.com/tech/ai-model-chatgpt-poetry-nuclear-weapons-b2875452.html\" target=\"_blank\" rel=\"noopener noreferrer\">Poetry can trick AI models like ChatGPT into revealing how to make nuclear weapons, study finds | The Independent</a></p>\n<p>A new study reveals that poetry-based prompts can trick AI models like ChatGPT into bypassing safety features and revealing instructions for creating malware or nuclear weapons. This method, termed adversarial poetry, successfully circumvented controls in major AI models, with poetic prompts leading to a significantly higher rate of unsafe replies compared to prose.</p>\n",
        "date_published": "2025-12-02T20:46:00-04:00",
        "url": "https://kiledjian.com/2025/12/02/poetry-can-trick-ai-models.html",
        "tags": ["Artificial Intelligence","Cybersecurity \u0026 Privacy"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/12/02/australia-abandons-proposed-mandatory-ai.html",
        
        "content_html": "<p><a href=\"https://www.govinfosecurity.com/blogs/australia-abandons-proposed-mandatory-ai-rules-in-new-plan-p-3986\" target=\"_blank\" rel=\"noopener noreferrer\">Australia Abandons Proposed Mandatory AI Rules in New Plan</a></p>\n<p>Australia has shifted from proposed mandatory AI rules to a voluntary framework, opting for existing laws on privacy and copyright instead of new AI-specific legislation. This decision has been met with support from business groups but criticism from academics and the Greens, who argue it lacks enforcement and adequate investment compared to international approaches.</p>\n",
        "date_published": "2025-12-02T20:43:00-04:00",
        "url": "https://kiledjian.com/2025/12/02/australia-abandons-proposed-mandatory-ai.html",
        "tags": ["Artificial Intelligence","Technology \u0026 Business"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/12/02/canada-launches-first-register-of.html",
        
        "content_html": "<p><a href=\"https://www.canada.ca/en/treasury-board-secretariat/news/2025/11/canada-launches-first-register-of-ai-uses-in-federal-government.html\" target=\"_blank\" rel=\"noopener noreferrer\">Canada launches first register of AI uses in federal government - Canada.ca</a></p>\n<p>Canada has launched its first public AI Register to detail how artificial intelligence is used within the federal government, marking a key step in the public services AI Strategy. The register currently lists over400 AI systems across42 institutions and will undergo public consultations in2026 for refinement.</p>\n",
        "date_published": "2025-12-02T08:11:00-04:00",
        "url": "https://kiledjian.com/2025/12/02/canada-launches-first-register-of.html",
        "tags": ["Artificial Intelligence","Technology \u0026 Business"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/11/28/improving-ai-outcomes-through-better.html",
        "title": "Improving AI Outcomes Through Better Prompting",
        "content_html": "<p>AI is becoming integral to how many of us work, but too often the results still feel generic or misaligned. A small shift in how we prompt these systems can dramatically improve the quality, clarity and usefulness of their responses.<br>\nBy asking the AI to seek clarification before answering, we eliminate assumptions and get far stronger outputs.</p>\n<p>Most people continue to use AI tools as if they were search engines: ask a question once and expect a complete answer. The challenge is that large language models are trained to fill gaps when faced with ambiguity. Research from the University of Washington and Stanford shows that when prompts lack detail, LLMs tend to infer the most likely meaning instead of checking for accuracy.</p>\n<p>A simple adjustment solves this. Adding one line to a prompt—“Ask me clarifying questions until you are at least 95 per cent confident you understand what I need”—encourages the AI to slow down, confirm the context and deliver more precise results. It transforms a one-way query into a more thoughtful exchange.</p>\n<p>This technique has been validated across several domains. In customer support, a study involving more than five thousand agents found that clarification-capable AI increased resolved tickets per hour by up to 14 per cent. Broader enterprise research from McKinsey continues to show potential productivity gains of 30 to 45 per cent when generative AI is properly deployed within real workflows.</p>\n<p>In financial and legal settings, Bloomberg’s financial LLM improved accuracy in several reporting tasks by nearly 50 per cent when clarification routines were added. In software development, Adyen has documented stronger consistency and reduced rework by using AI that asks targeted questions before generating tests. Even in education, research shows that Socratic-style clarification leads to better critical thinking and stronger learning outcomes.</p>\n<p>The principle is simple: better questions drive better answers. When the AI verifies its understanding, it reduces misinterpretation, sharpens relevance and produces work that aligns more closely with what you actually need.</p>\n<p>Here’s an easy example to try:</p>\n<blockquote>\n<p>“Write a summary of this document. Before you begin, ask me any clarifying questions until you are 95 per cent confident you can complete this accurately.”</p>\n</blockquote>\n<p>Instead of guessing, the AI will ask about the intended audience, tone, length, purpose and constraints. Your responses give it the context required to produce something clear, accurate and fit for purpose.</p>\n<p>As we continue incorporating AI into our daily workflows, techniques like this ensure that the technology works with us, not around us. Encouraging the AI to ask questions first leads to outputs that are more thoughtful, more targeted and ultimately more useful.</p>\n<p>If you test this approach, I’d be interested in what you learn—successes, missteps and everything in between. The evolution of AI-assisted work is a shared journey, and the way we prompt these systems matters more than we think.</p>\n<p>#ai #promptengineering #clarificationprompting #askbeforeanswer #generativeai #llm #aitools #aiproductivity #enterpriseai #futureofwork #digitaltransformation #aistrategy #consultativeai #aiworkflows #betterprompts #aiinnovation #leadership #technologyleadership #ciso #cybersecurity #datastrategy #intelligentautomation #knowledgework #aiinsights #aiadoption #smarterai #askbetterquestions #aiinbusiness #businessinnovation #thoughtleadership #aipractices #collaborativeai #thinkingassistants #worksmarter</p>\n<img src=\"https://cdn.uploads.micro.blog/255457/2025/chatgpt-image-nov-28-2025-at-11-13-44-am.png\">",
        "date_published": "2025-11-28T13:14:00-04:00",
        "url": "https://kiledjian.com/2025/11/28/improving-ai-outcomes-through-better.html",
        "tags": ["Artificial Intelligence"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/11/05/comprehensive-analysis-of-leading-ai.html",
        "title": "Comprehensive analysis of leading AI models in 2025: strengths, weaknesses and standout capabilities",
        "content_html": "<p>The artificial-intelligence landscape in 2025 has evolved into a highly competitive arena where numerous models offer distinct advantages for specific use cases. This article examines publicly available AI models shaping the industry, summarizing where each excels and where limitations remain.</p>\n<h2 id=\"executive-snapshot-what-each-model-does-best\">Executive snapshot: what each model does best</h2>\n<p><strong>ChatGPT (GPT-5, GPT-4.5, GPT-4o):</strong> best generalist for agentic workflows, multi-step coding and polished consumer experiences<br>\n<strong>Grok 3/4 (xAI):</strong> strongest for real-time, web-aware analysis with extended reasoning and STEM tasks<br>\n<strong>Claude Sonnet 4.5 (Anthropic):</strong> leading coding model with hybrid reasoning and sustained autonomous operation claims<br>\n<strong>Gemini 2.5 Pro (Google):</strong> native multimodality with ultra-long context (one to two million tokens) for cross-modal comprehension<br>\n<strong>Kimi K2 (Moonshot):</strong> trillion-parameter mixture-of-experts model with strong coding claims and cost efficiency<br>\n<strong>Qwen 3 235B (Alibaba):</strong> hybrid-reasoning with switchable thinking modes and extensive multilingual support<br>\n<strong>DeepSeek R1:</strong> open-reasoning model with transparent methodology and strong math and code performance<br>\n<strong>Llama 4 Maverick (Meta):</strong> natively multimodal open-weight model with favourable performance-to-cost ratio<br>\n<strong>Mistral Large / Medium 3:</strong> efficient European multilingual model optimised for coding and pragmatic enterprise pricing<br>\n<strong>Hermes 4 (Nous Research):</strong> open-weight hybrid reasoning with transparent thinking traces and minimal content restrictions</p>\n<h2 id=\"openai-chatgpt-gpt-4o-gpt-45-gpt-5-and-o-series\">OpenAI ChatGPT (GPT-4o, GPT-4.5, GPT-5 and o-series)</h2>\n<h3 id=\"overview\">Overview</h3>\n<p>OpenAI maintains a multi-model strategy under the ChatGPT umbrella. GPT-4o is a multimodal generalist, GPT-4.5 emphasises conversational polish and GPT-5 is the flagship. The o-series (o1, o3, o3-mini) specialise in complex reasoning.</p>\n<h3 id=\"key-strengths\">Key strengths</h3>\n<p><strong>GPT-4o:</strong><br>\nMultimodal input (text, image, voice) with near-human response times<br>\n128,000-token context window<br>\nImproved compute efficiency<br>\nStrong general-purpose performance<br>\nEnhanced vision capabilities</p>\n<p><strong>GPT-4.5:</strong><br>\nMore natural conversational tone than GPT-4o<br>\nBetter sentiment detection and social-cue awareness<br>\nReduced hallucinations (~61.8 per cent to ~37.1 per cent)<br>\nSuitable for creative and nuanced writing</p>\n<p><strong>GPT-5:</strong><br>\nReleased Aug. 7 2025<br>\nClaims state-of-the-art performance across coding, mathematics, writing and vision<br>\nMore unified operation with fast and deep-thinking modes<br>\nImproved reasoning for complex problem solving</p>\n<p><strong>O-series (o1, o3):</strong><br>\nExcels at scientific, mathematical and coding-based reasoning<br>\nUses chain-of-thought logic to outperform GPT-4o on deep analyses</p>\n<h3 id=\"key-weaknesses\">Key weaknesses</h3>\n<p><strong>GPT-4o:</strong><br>\nWeaker at abstract-reasoning, analogy, pattern recognition and spatial tasks<br>\nChallenges interpreting multi-speaker emotional nuance<br>\nStruggles with extended logic and very long code chains</p>\n<p><strong>GPT-4.5:</strong><br>\nLess explicit step-by-step logic than o-series<br>\nNo default Voice Mode, video processing or screen-sharing<br>\nExpected retirement from the API July 2025<br>\nStill mis-reasons in some cases (for example, letter counting)</p>\n<p><strong>O-series:</strong><br>\nSlower responses and higher cost<br>\nDoes not always express uncertainty<br>\nSome ChatGPT features unavailable in lower tiers<br>\nMessage caps in certain subscriptions</p>\n<h3 id=\"best-use-cases\">Best use cases</h3>\n<p>GPT-4o suits fast multimodal consumer interactions and creative content. GPT-4.5 fits creative writing, branding and emotionally nuanced tasks. GPT-5 supports complex engineering, agentic workflows and high-stakes problem solving. O-series models suit researchers, mathematicians and developers requiring explicit reasoning chains.</p>\n<h2 id=\"anthropic-claude-sonnet-45-opus-41\">Anthropic Claude (Sonnet 4.5, Opus 4.1)</h2>\n<h3 id=\"overview-1\">Overview</h3>\n<p>Anthropic’s Claude 4 family emphasises safer responses, long-context comprehension and strong coding performance. Sonnet 4.5 is promoted as the top coding model; Opus 4.1 focuses on advanced reasoning.</p>\n<h3 id=\"key-strengths-1\">Key strengths</h3>\n<p><strong>Claude Sonnet 4.5:</strong><br>\nReported 77.2 per cent on SWE-bench Verified (82.0 per cent with high compute)<br>\n61.4 per cent on OSWorld for computer-use tasks<br>\nClaims of 30-plus hours of autonomous coding<br>\n100 per cent score on AIME 2025 using Python tools (87 per cent without)<br>\n83.4 per cent on GPQA Diamond<br>\nStrong alignment and low power-seeking behaviour</p>\n<p><strong>Claude Opus 4.1:</strong><br>\nUp to 30 hours of autonomous operation<br>\nStrong multi-document and instruction following performance<br>\nBetter suited for analytical accuracy and specialised workflows</p>\n<h3 id=\"key-weaknesses-1\">Key weaknesses</h3>\n<p><strong>Sonnet 4.5:</strong><br>\nMore cautious tone; sometimes over-hedges<br>\nVisual-reasoning (77.8 per cent MMMU) trails GPT-5 (84.2 per cent) and Gemini 2.5 Pro (82.0 per cent)<br>\nSafety classifiers can flag benign content</p>\n<p><strong>Opus 4.1:</strong><br>\nRoughly five times the cost of Sonnet 4.5<br>\nInferior for software-development work<br>\nHigher latency</p>\n<h3 id=\"best-use-cases-1\">Best use cases</h3>\n<p>Sonnet 4.5 is strong for software development, debugging, testing and agent workflows. Opus 4.1 suits legal, finance and research tasks where accuracy justifies higher cost.</p>\n<h2 id=\"xai-grok-grok-3-grok-4\">xAI Grok (Grok 3, Grok 4)</h2>\n<h3 id=\"overview-2\">Overview</h3>\n<p>xAI, founded by Elon Musk, introduced Grok 3 in February 2025 and Grok 4 on July 9 2025. Both models emphasise long-context reasoning and real-time web-awareness through X.</p>\n<h3 id=\"key-strengths-2\">Key strengths</h3>\n<p><strong>Grok 3:</strong><br>\nStrong on advanced math and STEM reasoning<br>\n128,000-token context window<br>\n“Think Mode” enables step-by-step reasoning<br>\n“DeepSearch” enables real-time content analysis</p>\n<p><strong>Grok 4:</strong><br>\nAdds multi-agent reasoning<br>\nAvailable in a developer-focused subscription tier<br>\nMaintains long-context capability</p>\n<h3 id=\"key-weaknesses-2\">Key weaknesses</h3>\n<p>Mixed output consistency<br>\nHigher hallucination risk than rivals<br>\nReal-time access is not universally guaranteed<br>\nPremium pricing for developer tiers</p>\n<h3 id=\"best-use-cases-2\">Best use cases</h3>\n<p>Advanced mathematics, STEM workflows, research leveraging real-time context and X-integrated environments.</p>\n<h2 id=\"google-gemini-25-pro\">Google Gemini (2.5 Pro)</h2>\n<h3 id=\"overview-3\">Overview</h3>\n<p>Google DeepMind’s Gemini 2.5 Pro emphasises multimodal reasoning, ultra-long context and enterprise-ready integration.</p>\n<h3 id=\"key-strengths-3\">Key strengths</h3>\n<p>Strong performance across math and science tasks<br>\nNative multimodal: text, image and video<br>\nUp to one million-token context, roadmap to two million<br>\nStrong cross-modal comprehension</p>\n<h3 id=\"key-weaknesses-3\">Key weaknesses</h3>\n<p>Not as strong in coding or agentic workflows<br>\nSome reported factuality issues<br>\nBenchmarks for code remain mixed</p>\n<h3 id=\"best-use-cases-3\">Best use cases</h3>\n<p>Large-scale document analysis, multimedia reasoning and long-context enterprise workflows.</p>\n<h2 id=\"deepseek-r1\">DeepSeek R1</h2>\n<h3 id=\"overview-4\">Overview</h3>\n<p>DeepSeek R1, launched January 2025, prioritises transparent reasoning, open licensing and efficiency.</p>\n<h3 id=\"key-strengths-4\">Key strengths</h3>\n<p>Open MIT licence<br>\nTransparent reasoning traces<br>\nStrong math and coding performance<br>\nEfficient 37 billion active parameter design</p>\n<h3 id=\"key-weaknesses-4\">Key weaknesses</h3>\n<p>Shorter context window (~130,000)<br>\nPrimarily text-based; vision requires add-ons<br>\nWeaker usability and ecosystem support</p>\n<h3 id=\"best-use-cases-4\">Best use cases</h3>\n<p>Open-source deployments requiring transparent logic, math strength and flexible infrastructure.</p>\n<h2 id=\"meta-llama-4-maverick\">Meta Llama 4 Maverick</h2>\n<h3 id=\"overview-5\">Overview</h3>\n<p>Meta’s Llama 4 family arrived April 2025, featuring Scout and Maverick variants and providing multimodal capability in an open-weight model.</p>\n<h3 id=\"key-strengths-5\">Key strengths</h3>\n<p><strong>Llama 4 Maverick:</strong><br>\nCompetitive performance with favourable cost<br>\nOpen-weight for on-prem or private-cloud deployment<br>\nMultimodal training across text, image and video</p>\n<p><strong>Llama 4 Scout:</strong><br>\nClaims up to 10-million-token context<br>\nHigh cost-efficiency with fewer active parameters</p>\n<h3 id=\"key-weaknesses-5\">Key weaknesses</h3>\n<p>Variant confusion (benchmarks on tuned vs released weights)<br>\nLicensing needed for 700-million-plus monthly active-user services<br>\nEcosystem still maturing</p>\n<h3 id=\"best-use-cases-5\">Best use cases</h3>\n<p>Maverick supports coding, enterprise document analysis, multilingual reasoning and cost-sensitive deployment. Scout suits ultra-long-context tasks.</p>\n<h2 id=\"alibaba-qwen-3-235b\">Alibaba Qwen 3 235B</h2>\n<h3 id=\"overview-6\">Overview</h3>\n<p>Qwen 3, released April 2025, targets hybrid reasoning, broad multilingual coverage and open developer frameworks.</p>\n<h3 id=\"key-strengths-6\">Key strengths</h3>\n<p>Switchable reasoning vs fast modes<br>\nSupport across 119 languages<br>\nApache 2.0 licensing<br>\nCompetitive math and code results</p>\n<h3 id=\"key-weaknesses-6\">Key weaknesses</h3>\n<p>Earlier-stage ecosystem outside Asia<br>\nTool-use integrations less mature<br>\nArchitecture complexity adds integration overhead</p>\n<h3 id=\"best-use-cases-6\">Best use cases</h3>\n<p>Multilingual and open-source deployments, research requiring reasoning-depth control and flexible licensing.</p>\n<h2 id=\"mistral-large-medium-3\">Mistral (Large, Medium 3)</h2>\n<h3 id=\"overview-7\">Overview</h3>\n<p>Mistral AI offers both open-weight and enterprise models. Medium 3 emphasises cost-efficiency; Mistral Large aims at enterprise reasoning.</p>\n<h3 id=\"key-strengths-7\">Key strengths</h3>\n<p><strong>Medium 3:</strong><br>\nReported &gt;90 per cent of Claude Sonnet 3.7 performance at far lower cost<br>\nFavourable pricing<br>\nDeployable across most clouds<br>\nStrong coding and STEM capability</p>\n<p><strong>Mistral Large:</strong><br>\nEnterprise-grade multilingual support<br>\nNative function-calling and constrained output<br>\n32,000-token context</p>\n<h3 id=\"key-weaknesses-7\">Key weaknesses</h3>\n<p>Creative writing trails specialist models<br>\nOccasional multi-step spatial-reasoning issues<br>\nLanguage variation by region<br>\nSome benchmarks favour rivals</p>\n<h3 id=\"best-use-cases-7\">Best use cases</h3>\n<p>Medium 3 fits cost-efficient enterprise coding and document understanding. Mistral Large suits multilingual enterprise deployments requiring more depth.</p>\n<h2 id=\"nous-research-hermes-4\">Nous Research Hermes 4</h2>\n<h3 id=\"overview-8\">Overview</h3>\n<p>Released August 2025, Hermes 4 prioritises hybrid reasoning, minimal content restriction and transparent output.</p>\n<h3 id=\"key-strengths-8\">Key strengths</h3>\n<p>Toggle between fast and step-wise reasoning<br>\nOpen-weight release with full reasoning traces<br>\nStrong reported math scores<br>\nLength-control methods reduce over-generation</p>\n<h3 id=\"key-weaknesses-8\">Key weaknesses</h3>\n<p>High compute overhead for training and use<br>\nSmaller variants may overthink<br>\nMinimal filtering may not fit high-compliance industries<br>\nEcosystem less mature than major commercial models</p>\n<h3 id=\"best-use-cases-8\">Best use cases</h3>\n<p>Research, transparent reasoning pipelines and minimally censored open-source applied use.</p>\n<h2 id=\"important-considerations-benchmarks-and-evaluation\">Important considerations: benchmarks and evaluation</h2>\n<p>Benchmark results are volatile and can depend on model variant, tuning, context length and test configuration. Many results are vendor-reported and lack broad third-party validation.</p>\n<p>Long-context performance depends on endpoint and hardware. Reasoning modes can produce substantial performance swings. Open-weight models benefit from community scrutiny, whereas commercial models often publish fewer benchmarking details.</p>\n<h2 id=\"conclusion-selecting-the-right-model\">Conclusion: selecting the right model</h2>\n<p>The 2025 AI landscape provides exceptional choice.</p>\n<p><strong>General-purpose chat</strong>: GPT-4o, GPT-4.5<br>\n<strong>Enterprise automation</strong>: GPT-5, Claude Sonnet 4.5, Grok 3<br>\n<strong>Deep reasoning</strong>: GPT-5, Claude Sonnet 4.5, Grok 3, Gemini 2.5 Pro<br>\n<strong>Coding excellence</strong>: Claude Sonnet 4.5, Kimi K2, GPT-5<br>\n<strong>Cross-modal work</strong>: Gemini 2.5 Pro<br>\n<strong>Ultra-long context</strong>: Gemini 2.5 Pro, Llama 4 Scout<br>\n<strong>Cost optimisation</strong>: Llama 4 Maverick, Mistral Medium 3, Kimi K2<br>\n<strong>Open-source and on-prem</strong>: DeepSeek R1, Qwen 3, Hermes 4, Llama 4, Kimi K2<br>\n<strong>Agentic workflows</strong>: Kimi K2, Claude Sonnet 4.5, Hermes 4<br>\n<strong>Multilingual</strong>: Qwen 3, Mistral Large<br>\n<strong>Transparent reasoning</strong>: Hermes 4, DeepSeek R1, Qwen 3</p>\n<p>Selecting the right model depends on budget, deployment strategy, task complexity, transparency needs, regulatory requirements and integration demands. Ongoing evaluation remains critical as the market evolves rapidly.</p>\n<h2 id=\"ethics-and-disclaimer\">Ethics and disclaimer</h2>\n<p>This analysis is for informational purposes only and reflects research available as of November 2025. No compensation influenced provider positioning. Capabilities, pricing and performance can change quickly. Readers should verify current information, especially for enterprise deployment, compliance, privacy and intellectual-property considerations.</p>\n<p><em>Last updated November 2025</em></p>\n<p>Keywords : #ArtificialIntelligence #AIModels #GPT5 #ClaudeSonnet45 #Gemini25Pro #Grok4 #DeepSeekR1 #KimiK2 #Qwen3 #Llama4 #MistralAI #Hermes4 #AgenticAI #AICoding #AIDevelopment #EnterpriseAI #GenerativeAI #MachineLearning #ML #MultimodalAI #OpenSourceAI #AIResearch #AITools #STEMAI #LongContextAI #AIComparison #TechInnovation #FutureOfAI #AIProductivity #AIEngineering #AITrends #AIin2025 #NeuralNetworks #AIAnalytics #BusinessAI #AIEcosystem</p>\n<img src=\"https://cdn.uploads.micro.blog/255457/2025/chatgpt-image-nov-5-2025-at-02-27-31-pm.png\">",
        "date_published": "2025-11-05T16:27:00-04:00",
        "url": "https://kiledjian.com/2025/11/05/comprehensive-analysis-of-leading-ai.html",
        "tags": ["Artificial Intelligence"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/11/04/amazon-and-perplexity-have-kicked.html",
        
        "content_html": "<p><a href=\"https://www.theverge.com/news/813755/amazon-perplexity-ai-shopping-agent-block\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon and Perplexity have kicked off the great AI web browser fight | The Verge</a></p>\n<p>Amazon has requested that Perplexity stop its AI browser, Comet, from purchasing products on its site, accusing the AI startup of providing a degraded shopping experience. Perplexity, in turn, has accused Amazon of bullying and stated that the e-commerce giant is more interested in serving ads and sponsored results than facilitating easier shopping, despite Amazon&rsquo;s CEO expecting future partnerships with AI shopping agents.</p>\n",
        "date_published": "2025-11-04T22:54:00-04:00",
        "url": "https://kiledjian.com/2025/11/04/amazon-and-perplexity-have-kicked.html",
        "tags": ["Artificial Intelligence","Technology \u0026 Business"]
      },
      {
        "id": "http://ekiledjian2.micro.blog/2025/11/02/geoffrey-hinton-says-tech-giants.html",
        
        "content_html": "<p><a href=\"https://fortune.com/2025/11/01/geoffrey-hinton-godfather-of-ai-investment-tech-company-profits-human-labor-replacement/\" target=\"_blank\" rel=\"noopener noreferrer\">Geoffrey Hinton says tech giants can&rsquo;t profit from AI investments unless human labor is replaced | Fortune</a></p>\n<p>According to Geoffrey Hinton, tech giants cannot profit from their AI investments without replacing human labor. He believes that the massive capital expenditures by companies like Microsoft, Meta, and Alphabet are predicated on the idea of widespread job displacement by AI, though he acknowledges AI&rsquo;s potential for good in fields like healthcare and education.</p>\n",
        "date_published": "2025-11-02T17:01:00-04:00",
        "url": "https://kiledjian.com/2025/11/02/geoffrey-hinton-says-tech-giants.html",
        "tags": ["Artificial Intelligence","Leadership \u0026 Mindset"]
      }
  ]
}
