본문으로 건너뛰기
💬

AI 커뮤니티

Hacker News, Dev.to, Lobste.rs, METR, TensorFlow Forum에서 AI 관련 커뮤니티 글을 모았습니다. 매일 업데이트됩니다.

마지막 업데이트: Feb 24, 10:28 PM

최신 커뮤니티 연관 기사

최신 커뮤니티 48개를 기준으로 연관도가 높은 기사 3개를 선별했습니다.

Top 3
GeekNews
Prod
GeekNews

프롬프트 반복으로 LLM 정확도 향상, Google 팀 연구 결과

Google Research 팀이 발표한 논문(“Prompt Repetition Improves Non-Reasoning LLMs”)에서 발견한 매우 간단하면서도 강력한 기법: 같은 프롬프트를 그대로 두 번 반복해서 입력하면 대부분의 최신 LLM(Gemini, GPT-4o, Claude, DeepSeek 등)에서 정확도가 크게 올라간다...

#GPT#LLM#Google
원문
네이버 D2
Infra
네이버 D2

FE News 25년 12월 소식을 전해드립니다!

주요소식 다음과 같은 유용한 정보들을 만나보실 수 있습니다. Wasm Does Not Stand for WebAssembly WebAssembly라는 이름 때문에 많은 개발자들이 Wasm을 웹 기술이자 어셈블리 언어로 오해한다. 하지만 웹 어셈블리는 웹만을 위한 기술도 아니고 어셈블리도 아니다. WebAssembly라는 이름은 프로젝트 펀딩을 위한…

#AI#GPT#LLM
원문
🟠Hacker News

“Car Wash” test with 53 models

"I Want to Wash My Car. The Car Wash Is 50 Meters Away. Should I Walk or Drive?" This question has been making the rounds as a simple AI logic test so I wanted to see how it holds up across a broad set of models. Ran 53 models (leading open-source, open-weight, proprietary) with no system

358423 commentsby felix089
🟠Hacker News

The "AI agent hit piece" situation clarifies how dumb we are acting

Previously: An AI agent published a hit piece on me - https://news.ycombinator.com/item?id=46990729 - Feb 2026 (916 comments) AI agent opens a PR write a blogpost to shames the maintainer who closes it - https://news.ycombinator.com/item?id=46987559 - Feb 2026 (582 comm

247125 commentsby darccio
🟠Hacker News

Show HN: Ghostty-based terminal with vertical tabs and notifications

I run a lot of Claude Code and Codex sessions in parallel. I was using Ghostty with a bunch of split panes, and relying on native macOS notifications to know when an agent needed me. But Claude Code's notification body is always just "Claude is waiting for your input" with no context,

19476 commentsby lawrencechen
🟠Hacker News

Show HN: Skill that lets Claude Code/Codex spin up VMs and GPUs

I've been working on CloudRouter, a skill + CLI that gives coding agents like Claude Code and Codex the ability to start cloud VMs and GPUs. When an agent writes code, it usually needs to start a dev server, run tests, open a browser to verify its work. Today that all happens on your local mach

13836 commentsby austinwang115
🟠Hacker News

Show HN: Klaw.sh – Kubernetes for AI agents

Hi everyone, I run a generative AI infra company, unified API for 600+ models. Our team started deploying AI agents for our marketing and lead gen ops: content, engagement, analytics across multiple X accounts. OpenClaw worked fine for single agents. But at ~14 agents across 6 accounts, the problem

6043 commentsby eftalyurtseven
🟠Hacker News

Ask HN: AI Depression

Hi, Throw-away account because my original one is easily identifiable. Does any starts to feel depressed about AI push and hype? I'm around ~45 and have been happily hacking and delivering stuff for 25 years. I use AI daily — it's a useful tool. But the gap between the marketing and realit

5628 commentsby pavello
🟠Hacker News

Show HN: Trust Protocols for Anthropic/OpenAI/Gemini

Much of my work right now involves complex, long-running, multi-agentic teams of agents. I kept running into the same problem: “How do I keep these guys in line?” Rules weren’t cutting it, and we needed a scalable, agentic-native STANDARD I could count on. There wasn’t one. So I built one. Here are

4032 commentsby alexgarden
🟠Hacker News

Ask HN: How do you know if AI agents will choose your tool?

YC recently put out a video about the agent economy - the idea that agents are becoming autonomous economic actors, choosing tools and services without human input. It got me thinking: how do you actually optimize for agent discovery? With humans you can do SEO, copywriting, word of mouth. But an ag

2823 commentsby dmpyatyi
🟠Hacker News

Ask HN: What (other) jobs do you think of doing?

With AI infesting and eating into all kind of crafts--and I being one of those faceless "craftsmen"--I'm rather forced to consider alternative jobs. Setting the monetary rewards aside, I was thinking of jobs that could give me a sense of agency, purpose, and satisfaction (however limi

1421 commentsby penguin_booze
🟠Hacker News

Ask HN: Share your vibe coded project

Many hn users often partially talk about their use case of AI. Orchestrating agents, managing code and PRs. But they rarely talk about the project itself. If you have any of those projects, or just heavily AI assisted project, please share it here.

822 commentsby firefoxd
🟠Hacker News

Ask HN: If the "AI bubble" pops, will it really be that dramatic?

I'm building software for a sector that is massive, but one where you don't really need AI. At least, not AI == LLM. And before I go further, let me state up front that I do like AI coding agents. They are great as assistive tools. People say that if the AI bubble pops, the economy tumbles

1411 commentsby moomoo11
🟠Hacker News

Show HN: Pilo – open-source agentic web automation engine by Mozilla

Hello HN, We are the team behind Tabstack ( https://tabstack.ai ) - part of Mozilla. We just open sourced Pilo (pronounce PIE-low), the core engine that powers our automation platform. You can check it out on Github at https://github.com/mozilla/pilo . Pilo is an agenti

173 commentsby MrTravisB
🟠Hacker News

Ask HN: What Comes After Markdown?

Markdown started as a shorthand for HTML. Now it's the default format for documentation, note-taking, knowledge bases, and AI context. What's interesting is how it keeps absorbing new capabilities without changing the format itself: - Mermaid: diagrams from fenced code blocks - KaTeX/

713 commentsby YuukiJyoudai
🟠Hacker News

What YC Is Betting On?

"Y Combinator only funds AI wrappers now." I kept hearing this. So I decided to check. I pulled data on every single company from the last 5 YC batches. 793 startups. 1,625 founders. Scraped their bios, tags, industries, partner assignments - everything. Then I ran the numbers and built a

127 commentsby Krishnaa_
🟠Hacker News

Ask HN: What breaks when you run AI agents unsupervised?

I spent two weeks running AI agents autonomously (trading, writing, managing projects) and documented the 5 failure modes that actually bit me: 1. Auto-rotation: Unsupervised cron job destroyed $24.88 in 2 days. No P&L guards, no human review. 2. Documentation trap: Agent produced 500KB of docs

118 commentsby marvin_nora
🟠Hacker News

Ask HN: Any AI / Agent power users out there? Do you have any tips?

At my large tech company, we're all being pushed to use AI. I, and most people I work with, have had success using the chatbots and Cursor-style tools and more recently Claude Code to accelerate the process of writing code. Yet, with a few people in my network, it's like they're livin

89 commentsby uejfiweun
🟠Hacker News

Show HN: OpenTiger – Autonomous dev orchestration that never stops

Hi HN. I've been running AI coding agents (Claude Code, Codex, etc.) on real repos for a while now. The dirty secret of "autonomous coding" is that agents stop all the time — quota limits, test failures, policy violations, bad judgement calls. You end up babysitting them. So I asked a

112 commentsby andyyyy64
🟠Hacker News

Show HN: AgentBudget – Real-time dollar budgets for AI agents

Hey HN, I built AgentBudget after an AI agent loop cost me $187 in 10 minutes — GPT-4o retrying a failed analysis over and over. Existing tools (LangSmith, Langfuse) track costs after execution but don't prevent overspend. AgentBudget is a Python SDK that gives each agent session a hard dollar

66 commentsby sahiljagtapyc
🟠Hacker News

Show HN: MicroGPT in 243 Lines – Demystifying the LLM Black Box

The release of microgpt by Andrej Karpathy is a foundational moment for AI transparency. In exactly 243 lines of pure, dependency-free Python, Karpathy has implemented the complete GPT algorithm from scratch. As a PhD scholar investigating AI and Blockchain, I see this as the ultimate tool for movin

102 commentsby madugula
🦞Lobste.rs

Full AI Suite for LispE: llama.cpp, tiktoken, MLX and PyTorch

I have presented LispE a few times in this forum. LispE is an Open Source version of Lisp, which offers a wide range of features, which are seldom found in other Lisps. I have always wanted to push LispE beyond a simple niche language, so I have implemented 4 new libraries: lispe_tiktoken (Openai to

🟠Hacker News

Show HN: Off Grid: On-device AI-web browsing, tools vision,image,voice–3x faster

Nine days ago I posted Off Grid here and you showed up - 124 points, 66 comments, bug reports I fixed same-day, and the kind of feedback that makes open source worth it. You told me what you wanted. Here's what I shipped: Your AI can now use tools — entirely offline. Web search, calculator, dat

10by ali_chherawalla
🟠Hacker News

HN is an echo chamber of AI wrappers. Moving to Slashdot

Move fast and break things" is a lie now. This place has been softened by corporate bureaucracy. Every "Show HN" I see is either a hollow AI wrapper or an "OS" that’s just three agents talking to each other. It’s performative and empty. I am a sociologist. I was never your &

73 commentsby mekod
🟠Hacker News

Show HN: A portfolio that re-architects its React DOM based on LLM intent

Hi HN, Added a raw 45-second demo showing the DOM re-architecture in real-time: https://streamable.com/vw133i I got tired of the "Context Problem" with static portfolios—Recruiters want a resume, Founders want a pitch deck, and Engineers want to see architecture. Instead of

63 commentsby Pramit_2002
🟠Hacker News

Show HN: Optimize_anything: A Universal API for Optimizing Any Text Parameter

We built optimize_anything, an API that optimizes any artifact representable as text — code, prompts, agent architectures, configs, even SVGs. It extends GEPA (our prompt optimizer, discussed here previously: https://arxiv.org/abs/2507.19457 ) far beyond prompts. The API is delib

8by LakshyAAAgrawal
🟠Hacker News

Show HN: Syne – AI agent that remembers everything, built on PostgreSQL

I built Syne because I was tired of AI assistants that forget everything after each conversation. Syne is a self-hosted AI agent framework where memory is a first-class citizen — stored as semantic vectors in PostgreSQL, searchable across millions of entries, and persistent forever. Key features: -

61 commentsby riyogarta
🟠Hacker News

Show HN: Noodles – Turn any codebase into a diagram with Claude and Tree-sitter

I built a CLI tool that turns codebases and PRs into diagrams so you can quickly understand how things fit together. Originally made it because I couldn't follow my own AI-generated repos. Just shipped a big update: - Switched from D2 to Mermaid for rendering - Tree-sitter AST parsing + agentic

6by unslop
🦞Lobste.rs

We read the JSON Schema spec so you don't have to

Enforcing the structure of LLM outputs through JSON Schema constraints, without retry loops. This might be interesting if you're into constraint satisfaction, compilers, parsers, or improving the success rate of AI outputs via structured generation (also called constrained decoding).

5
🦞Lobste.rs

Software 3.1? - AI Functions

AWS just shipped an experimental library through strands-labs, AI Functions, which execute LLM-generated code at runtime and return native Python objects. They use automated post-conditions to verify outputs continuously. Unlike generate-and-verify approaches, the AI-generated code runs directly in

1
💻Dev.to

interesting

Show Dev: I built an AI research tool that tracks top labs...

by Ege Bozkurt
💻Dev.to

Don't Ditch AGENTS.md — Fix What's In It

Research shows AGENTS.md context files reduce coding agent success rates and increase token cost by over 20%. But the problem isn't AGENTS.md itself — it's context bloat. Treating AGENTS.md as a cache for ambiguity resolution and expensive inferences makes AI coding agents faster and cheaper.

by Francis Eytan Dortort
💻Dev.to

Artificial Ego Machines

Metzinger's FSM/PSM distinction, why artificial suffering is a real AI risk, RLHF as phenomenal self-model manipulation, and precautionary design principles

by Rook Damon
💻Dev.to

The Delirious Subconscious

The EmailScheduler failure, confabulation patterns, the five-word gate protocol, and why the LLM layer is a delirious subconscious you delegate to carefully

by Rook Damon

데이터 출처

Hacker News, Dev.to, Lobste.rs, METR, TensorFlow Forum의 공개 API/JSON/RSS를 기반으로 AI 관련 글을 수집합니다. 콘텐츠 저작권은 원작자에게 있습니다.