Binance Interview PrepApplied AI Agent Developer · Accelerator Program 2026
0 / 50
🙋

HR & 行为面试

第一轮和管理层轮均会问。用 STAR 框架(Situation / Task / Action / Result)来结构化回答。

已查看 0 / 10
↓ 按可能性从高到低排序 — 越靠上越重要
准备 2–3 个好问题(体现主动性,非常重要):
  • "What does a typical week look like for an AI Agent intern on this team?"
  • "Are there existing LangGraph or LLM-based systems in production at Binance that I'd be contributing to, or is the role more greenfield?"
  • "What are the biggest technical challenges the team faces with AI agent reliability at Binance's scale?"
  • "How does Binance think about the trade-off between moving fast with AI features and ensuring safety for financial users?"
💡 Tip: 永远不要说"No, I think you've covered everything." 这是你最后展示思考深度的机会。
建议结构(60–90 秒):
  • 背景:I'm a Computer Science student with a strong focus on AI systems and backend engineering.
  • 核心项目:Most recently I built an ASX multi-agent chatbot using LangGraph — it orchestrates several AI agents to retrieve and analyse Australian stock data in real time, with a streaming frontend that visualises the agent execution graph as it runs.
  • 技术栈:The stack includes Python, LangGraph, RAG-based retrieval, a TypeScript frontend, and a streaming layer.
  • 为什么 Binance:I'm now looking to apply these skills at the intersection of AI and real financial infrastructure — which is exactly what the Binance Accelerator Program offers.
💡 Tip: 不要背稿,要像和朋友说话一样自然。结尾直接过渡到"为什么 Binance"。
三个角度展开:
  • 规模与影响力:Binance serves 300 million users across 100+ countries. Any AI system I help build has immediate, massive real-world impact — that's rare at this career stage.
  • AI + Finance 交叉点:I've been building AI agents for financial data analysis — Binance is one of the few places where agentic AI is directly applied to live trading infrastructure and user products.
  • Web3 的信念:I believe decentralised finance genuinely expands financial access for people excluded from traditional banking. Working at Binance means contributing to that mission.
💡 Tip: 避免说"because it looks great on a resume"。面试官想听到你对 crypto/Web3 的真实理解。
用 ASX 项目的真实经历:
  • Situation: I needed a multi-agent orchestration layer but LangChain's standard chains couldn't handle conditional branching and state persistence.
  • Task: Migrate to LangGraph within a week while keeping the existing RAG retrieval pipeline intact.
  • Action: I read the LangGraph docs, built a small isolated prototype first to validate my understanding, then incrementally ported each agent node.
  • Result: Migrated the entire system in 5 days. The graph structure also made it much easier to add the streaming progress panel.
💡 Tip: 强调"先做小原型验证"这个思维方式,Binance 很看重能快速迭代的工程师。
关键点:沟通、责任划分、处理分歧:
  • 描述一个真实的团队经历(课程项目、黑客松、或工作经历)
  • 说清楚你具体负责什么,不要只说"we did X"
  • 如果有技术分歧,描述你如何用数据或 prototype 说服他人
  • Result 要量化:完成了什么、学到了什么
💡 Tip: 如果缺乏团队经验,可以谈和导师合作,或参与开源 community 的经历。
  • "Short-term: I want to go deep on production AI systems — understanding how to take an agent from a prototype to something that serves millions of users reliably."
  • "Medium-term: I'd love to contribute to Binance's AI infrastructure as a full-time engineer after graduating."
  • "Long-term: I'm interested in how AI can make financial markets more transparent and accessible for retail investors globally."
💡 Tip: 把你的目标和 Binance 的使命挂钩(financial freedom / global access)。
JD 明确说"ability to tackle ambiguous problems",说明 Binance 特别看重这个:
  • "When I started the ASX chatbot, the requirement was just 'build something useful for ASX investors.' There was no spec."
  • I broke it into: (1) define 'useful' by listing 5 user stories, (2) validate the riskiest assumption first — whether LLM + ASX data actually produces reliable answers, (3) time-box explorations to 2-day spikes before committing to an architecture.
  • Result: shipped a working prototype in 2 weeks because I avoided over-engineering before the core premise was validated.
💡 Tip: 面试官想看到的是:不慌张、系统拆解、快速实验、敢于决策。
诚实但有成长弧度:
  • 选一个真实的弱点,但不是核心岗位技能。
  • 好例子:"I'm still building fluency with distributed systems and production deployment at scale — I've mostly worked with single-node setups locally."
  • 然后接:"I'm addressing this by studying Binance's engineering blog and experimenting with Docker and Kubernetes."
💡 Tip: 不要选假弱点("我太努力了")。选一个边缘技能,展示自我意识和主动学习。
要具体,不要泛泛说"I read papers":
  • 具体来源:Hugging Face Daily Papers / Simon Willison's blog / LangGraph changelogs
  • 实践驱动:"I try to implement something small from every major update I read — the streaming overhaul in my ASX project came from testing a new LangGraph release."
  • 社群参与:GitHub discussions, Discord, Twitter/X AI community
💡 Tip: 提一个你最近读到的具体更新,来证明你真的在跟进。
  • 如果你会中文:"Yes, Mandarin is my first language — I'm very comfortable communicating in both English and Chinese, which I think is an asset in a global team."
  • 如果英文为主:"My primary language is English. I understand Binance has a strong Chinese-speaking team, and I'm open to picking up more Mandarin as I go."
💡 Tip: 中文能力是加分项,不是门槛。诚实表达即可。
💼

项目深度问答

面试官会深挖你的 ASX Multi-Agent LangGraph Chatbot。他们想看你真正懂自己做的东西,而不只是会介绍功能。

已查看 0 / 10
↓ 按可能性从高到低排序 — 越靠上越重要
用 5 句话讲清楚:Problem → Architecture → Key decision → Frontend → Result
  • Problem: Retail investors want to ask natural language questions about ASX stocks ("Is BHP overvalued?") but data is scattered across financial APIs, news feeds, and filings.
  • Architecture: A LangGraph-based orchestrator routes user queries to specialised agent nodes — a retrieval agent (RAG over financial documents), a data-fetch agent (live stock prices), and a synthesis agent (LLM reasoning).
  • Key decision: I chose LangGraph over simple chains because I needed stateful, conditional routing — some queries only need retrieval, others need live data + retrieval combined.
  • Frontend: A TypeScript streaming UI shows the agent graph executing in real time — each node lights up as it runs.
  • Result: The system can answer complex multi-source questions with RAG ensuring answers are grounded in actual documents.
💡 Tip: 画一个简单的图:User → Router → [Retrieval Node, Data Node] → Synthesis → Response。能画出来比只描述强很多。
  • State persistence: A simple Python loop has no built-in state — if a node fails mid-way, you lose everything. LangGraph's checkpointer persists state after each node, enabling retry without rerunning the whole pipeline.
  • Conditional routing: I needed branching logic — "if the query requires live data, route to the data agent; otherwise skip it." LangGraph's conditional edges express this cleanly.
  • Streaming: LangGraph emits events per node completion, which I forward to the frontend. This enables the real-time graph visualisation.
  • Query reuse: The explicit graph structure made adding a cross-cutting cache optimisation straightforward.
💡 Tip: 一定要说"我考虑过 X 方案,但选了 LangGraph 因为…"。说出你拒绝的选项,显示你做过真正的架构权衡。
过程比结果更重要。用这个结构:
  • Symptom → What I initially thought → How I actually diagnosed it → Root cause → Fix
  • 例子方向:GraphPanel 去重 bug(事件重复导致 UI 闪烁)、TypeScript 类型窄化引发的 dead code、status 守卫判断问题——根据你项目真实的 git history 选一个。
  • 强调调试工具:structured logging, TypeScript type narrowing, event tracing.
💡 Tip: "I added logging at every node boundary, which revealed the event was emitted twice due to LangGraph's retry mechanism." 这类细节让面试官相信你真正 hands-on。
  • Semantic caching: Replace exact-match cache with embedding-based similarity matching to handle paraphrased queries.
  • Automated evaluation: Build an LLM-as-judge pipeline that scores responses against a golden dataset nightly.
  • Re-ranking: Add a cross-encoder re-ranker between retrieval and synthesis to improve chunk precision.
  • Observability: Integrate LangSmith or OpenTelemetry to visualise end-to-end latency breakdown per node.
  • Auth & multi-user: The current system is single-user — proper auth and per-user conversation history would make it production-ready.
  • Backend: LangGraph's astream_events() yields events for each node start/end/error. I pipe these as Server-Sent Events to the client.
  • Frontend: A TypeScript component maintains a graphState object tracking each node's status (pending | running | completed | error).
  • Rendering: Each status maps to a visual style — the node card pulses while running, turns green on completion, shows an error badge on failure.
  • Deduplication: LangGraph can emit multiple events for the same node in retries — I implemented deduplication to prevent UI flickering.
💡 Tip: 如果被问"为什么用 SSE 不用 WebSocket":SSE 是单向流,足够满足状态推送需求,且更简单可靠;WebSocket 的双向能力在这里是过度工程。
  • Problem: When a user asks follow-up questions within the same session, sub-queries to the retrieval agent were often identical — fetching the same documents repeatedly at cost and latency.
  • Solution: A session-scoped query cache keyed by a hash of the query + filter parameters. Before executing a retrieval node, the graph checks the cache; on a hit, it bypasses the vector DB call entirely.
  • Impact: Retrieval latency dropped ~60% on cache hits. Also reduced embedding API costs proportionally.
💡 Tip: 这是 semantic caching 的简化版。可以说:"In production at Binance's scale, you'd want full semantic caching — matching similar queries by embedding similarity, which handles ~30% of enterprise query traffic."
  • Checkpointing: LangGraph's checkpointer saves state after each node. On failure, I catch the exception, mark the node as errored in state, and return a graceful error message rather than crashing the whole graph.
  • Retry logic: For transient failures (rate limits, timeouts), exponential backoff with a max of 3 retries before marking the node as failed.
  • Graceful degradation: If the live data agent fails, the graph falls back to retrieval-only mode rather than returning nothing.
  • Problem: After retrieval, the graph state only contained raw chunks. Downstream agents had no visibility into why those chunks were retrieved — which source, what relevance score, whether from a news article or a financial filing.
  • Solution: I enriched each retrieved chunk with metadata: source document type, recency, relevance score, and retrieval method (dense embedding vs. keyword fallback).
  • Benefit 1: The synthesis agent can now down-weight chunks from older filings when news is more recent.
  • Benefit 2: The frontend can display "Sources" with confidence indicators — transparency into what the answer is based on.
💡 Tip: 用"provenance"这个词描述 chunk 的来源追踪——在金融 AI 里,能回答"为什么你这么说"几乎和答案本身一样重要。
  • Retrieval quality: Built a small test set of questions with known correct answers. Measured recall@5 (were the right documents in the top 5 retrieved?).
  • Answer quality: Manually rated a sample of responses on factual accuracy and relevance.
  • Latency: Timed end-to-end response time. Retrieval-only: ~1.5s, full multi-agent: ~4–5s.
  • Cost tracking: Logged token usage per query to estimate cost per conversation.
  • Gap I acknowledge: I didn't implement automated LLM-as-judge evaluation — that would be the natural next step.
  • Backend owns all intelligence: LangGraph orchestration, retrieval, LLM calls, caching — nothing intelligent happens in the browser.
  • Frontend owns display state only: The TypeScript frontend tracks node statuses for UI rendering but never makes business logic decisions.
  • Protocol: Backend streams structured events (node_started, node_completed, final_response) — the frontend maps these to visual states.
  • Why this split: Keeps the frontend thin and testable. A CLI or API client can talk to the same backend with no UI coupling.
🤖

LLM & RAG 基础

AI Agent 岗位的核心技术面试内容。面试官想确认你真正理解 LLM 的工作原理,而不只是会调 API。

已查看 0 / 10
↓ 按可能性从高到低排序 — 越靠上越重要
  • Prompt engineering: Modify the input to steer model behaviour. Zero cost, zero latency overhead. Use when the model already has the knowledge and just needs better instruction. Limit: can't inject dynamic or private data reliably.
  • RAG: Retrieve relevant documents at query time and inject into context. Use when answers depend on private, current, or domain-specific data (e.g., ASX filings, Binance market data). No retraining needed. Trade-off: adds retrieval latency.
  • Fine-tuning: Update model weights on domain-specific data. Use when you need consistent style, format, or reasoning patterns that can't be achieved with prompts. Trade-off: expensive, slow iteration cycle.
💡 Decision rule: Try prompting first → RAG if knowledge is dynamic/private → fine-tuning only if the above two can't meet quality targets.
Offline (indexing):
  • Load and clean documents → chunk into segments → embed each chunk (text-embedding-3-small) → store vectors + metadata in a vector DB
Online (query time):
  • Embed the user query → run ANN search in vector DB → optionally re-rank results → inject top-k chunks into LLM prompt → LLM generates grounded response
💡 Key insight: "The most common failure point is chunking strategy. Bad chunks degrade both retrieval precision and generation quality."
定义:LLMs generate plausible-sounding but factually incorrect content because they optimise for fluency, not truth.

Mitigations:
  • RAG grounding: Force the model to answer only from retrieved context, not from parametric memory.
  • Citation enforcement: Prompt the model to cite the specific chunk it's drawing from. If it can't cite it, it shouldn't say it.
  • Output validation: Use a second LLM call to verify the response is consistent with retrieved documents.
  • Uncertainty expression: Ask the model to say "Based on available data, I cannot confirm…" rather than confabulating.
💡 Binance angle: In financial AI, a confabulated stock price could cause real harm. Mention you'd implement strict citation + output validation for any financial use case.
  • Traditional DB: Stores structured data, supports exact-match and range queries. Can't find "semantically similar" records.
  • Vector DB: Stores high-dimensional float vectors (embeddings). Supports approximate nearest-neighbour (ANN) search — finding vectors semantically close in embedding space.
  • How ANN works: Algorithms like HNSW (Hierarchical Navigable Small World) allow sub-linear time lookup across millions of vectors.
  • Hybrid search: Production RAG systems combine vector search (semantic recall) + BM25 keyword search (keyword precision).
  • What: Prompting the model to reason step-by-step before answering — "Let's think through this step by step…"
  • When it helps: Multi-step reasoning (maths, logic, planning), complex instruction-following.
  • When it doesn't help: Simple retrieval tasks (adds latency/tokens with no benefit), tasks where speed matters more than accuracy.
  • Cost: CoT increases token usage significantly — balance accuracy gain against latency and cost in production.
  • Hallucination: Model generates false facts — especially dangerous in financial contexts.
  • Tool call failures: LLM selects wrong tool, passes malformed arguments, or misinterprets tool output.
  • Infinite loops: Agent keeps calling tools without making progress — needs loop detection and max-step limits.
  • Context window overflow: Accumulated tool outputs exceed context limit — need summarisation or pruning.
  • Prompt injection: Adversarial content in retrieved documents manipulates agent behaviour.
Two levels: offline evaluation + online monitoring:
  • Offline: Golden test set (curated Q&A pairs), retrieval metrics (Recall@k, Precision@k), LLM-as-judge scoring responses on rubrics (accuracy, relevance, conciseness).
  • Online: Latency (p50/p95/p99), tool call success rate, user feedback signals (thumbs up/down), cost per conversation, error rates by node type.
  • Small chunks (100–300 tokens): Higher retrieval precision. Risk: fragments context, losing surrounding information.
  • Large chunks (500–1500 tokens): Preserves more context. Risk: lower precision — dilutes relevance scores with irrelevant text.
  • Overlap: Add 10–20% token overlap between consecutive chunks to avoid cutting information across boundaries.
  • Practical approach: Start with 512 tokens + 10% overlap, measure retrieval recall on a test set, then tune based on document type.
  • What: Instead of caching only exact-match queries, embed the query and cache responses keyed by embedding similarity. If a new query is semantically close to a cached one (cosine similarity > threshold), return the cached response.
  • Why valuable: Research shows ~30% of enterprise LLM queries are semantically equivalent paraphrases of previous queries.
  • Risk: False positives — semantically similar queries that actually need different answers. Must tune the similarity threshold carefully.
Cost:
  • Semantic caching — serve cached responses for semantically equivalent queries
  • Model routing — use cheap models (Haiku, Llama) for simple tasks, frontier models only for complex reasoning
  • Prompt compression — remove redundant context, shorten system prompts
Latency:
  • Streaming — stream tokens so users perceive faster response
  • Parallel tool calls — execute independent tool calls concurrently with asyncio.gather()
  • Pre-filter context — only inject the most relevant chunks to keep prompt short
🕸️

LangGraph & Agent 系统设计

最直接相关的技术问题。大部分偏深,internship 面试可能只浅问前几道,但如果能答出来会非常加分。

已查看 0 / 10
↓ 按可能性从高到低排序 — 越靠上越重要
  • LangChain chains: Linear sequence — A → B → C. Good for simple pipelines. Breaks down when you need loops, conditional branching, or multi-agent state.
  • LangGraph solves:
    • State management: Typed state schema flows through the graph; each node reads and writes state explicitly.
    • Conditional routing: Edge conditions decide which node runs next based on current state.
    • Persistence: Checkpointer saves state after each node — enables pause, resume, retry.
    • Streaming: Native event streaming per node — enables real-time UI updates like your GraphPanel.
ReAct loop:
  1. Thought: LLM reasons about the current state and what action to take
  2. Act: LLM selects a tool and generates its arguments
  3. Observe: Tool executes, result appended to context
  4. Repeat until LLM decides to return a final answer
  • LangGraph: ReAct is the default in create_react_agent() — a loop node alternating between LLM call and tool execution, with an edge condition that exits when the LLM stops calling tools.
  • Limit: Linear — one thought-act cycle at a time. Tree-of-Thought explores multiple paths simultaneously for complex problems.
  • Checkpointer (within-thread): Saves the full graph state after each node execution within a single conversation thread. Enables resuming a conversation and human-in-the-loop interruptions. Scoped to one session.
  • Store (cross-thread): A persistent key-value store shared across multiple conversation threads. Stores long-lived information like user preferences or shared knowledge bases that survive across sessions.
  • Analogy: Checkpointer = conversation history within one chat window. Store = a shared notebook all conversations can read and write.
💡 In your project: Your query cache is conceptually similar to Store — it persists retrieved results across the session for reuse.
  • Mechanism: LangGraph supports interrupt_before / interrupt_after on any node. The graph pauses, serialises its state via checkpointer, and waits for human input.
  • Flow: Agent plans action → Graph pauses → Human reviews in UI → Human approves/rejects → Graph resumes from checkpoint.
  • Use cases at Binance: Before placing a trade recommendation, before sending user notifications, before deleting data — any high-impact irreversible action.
💡 Safety insight: HITL at Binance is partly a regulatory compliance tool — financial AI needs audit trails of human approvals for high-value actions.
  • Classify errors first: Retryable (rate limit, timeout, transient network) vs. non-retryable (invalid arguments, permission denied).
  • Retry strategy: Exponential backoff with jitter for retryable errors. Max 3 attempts before failing the node.
  • Graceful degradation: If a tool fails after retries, fall back to alternative tools or inform the user — never silently return wrong data.
  • LangGraph: Wrap tool calls in try/except, update state with error info, use conditional edges to route to fallback nodes.
  • What: A TypedDict or Pydantic model defining the shape of data flowing through the graph — e.g., messages: list, retrieved_docs: list, query: str, error: str | None.
  • Why important: Type safety (catches mismatches at construction time), clarity (every node knows what it can read/write), streaming (subscribe to specific fields), checkpointing (determines what gets serialised and restored).
  • In multi-agent systems: All agents must agree on the shared state schema — it's the contract between them.
  • Supervisor pattern: A central orchestrator LLM decides which specialist agent to call next, collects results, synthesises the final output. Simple to reason about, single point of failure.
  • Peer-to-peer (handoff): Agents directly pass control to each other. More parallel, but harder to debug — needs clear handoff contracts.
  • In LangGraph: Use sub-graphs for each specialist agent. The supervisor graph has nodes that invoke the sub-graphs as units.
  • Key concerns: Shared state schema (contract between agents), cost tracking per agent, conflict resolution, tracing which agent made which decision.
Structure: Requirements → Architecture → Safety → Scale
  • Requirements: Natural language queries on crypto markets, news, on-chain data. <5s latency, <$0.01/query cost.
  • Architecture: Ingestion pipeline (news feeds, Binance API, on-chain data) → vector DB → LangGraph orchestrator → specialist agent nodes (Retrieval, Live Data, Analysis, Synthesis).
  • Caching layer: Semantic cache for repeated queries, TTL cache for live data.
  • Safety: HITL for trade-related recommendations, output validation for financial claims, audit logging.
  • Scale: Queue-based invocation, read replicas for vector DB, streaming to reduce perceived latency.
  • Traces: End-to-end trace per request — every LLM call, tool call, retrieval step with latency breakdown. Tools: LangSmith, OpenTelemetry + Jaeger.
  • Metrics: p50/p95/p99 latency per node, token usage, cost per conversation, tool success rate, cache hit rate.
  • Logs: Input query (anonymised), node outputs, errors with stack traces, LLM model used.
  • LangGraph built-in: astream_events() emits structured events per node — these are the raw material for your observability pipeline.
  • Cache first: Semantic cache check takes ~20ms. On a hit, total response time <50ms.
  • Use smaller models: For simple routing nodes, use fast models rather than frontier models.
  • Parallel retrieval: Run vector search and keyword search concurrently with asyncio.gather().
  • Pre-filter context: Reduce chunk count passed to the LLM — fewer tokens = faster generation.
  • Streaming: Start streaming output as soon as the first token is ready — perceived latency drops significantly.
  • Profile first: Use traces to identify which node accounts for most latency before optimising — don't guess.
🪙

Web3 & Binance 专项

Binance 面试必定会问你对 crypto 和 Web3 的基础理解——即使是技术岗也不例外。不需要成为专家,展示真诚的好奇心即可。

已查看 0 / 10
↓ 按可能性从高到低排序 — 越靠上越重要
  • Core: World's largest crypto exchange by trading volume. Serves 300M+ users in 100+ countries.
  • Products: Spot trading, futures/derivatives, staking, Binance Earn (yield products), NFT marketplace, Binance Pay (payments), Binance Academy (education), Trust Wallet.
  • BNB Chain: Binance's own blockchain. BNB is the native token — used for trading fee discounts, gas on BNB Chain, and quarterly burn mechanism.
  • AI relevance: Binance is actively investing in AI for fraud detection, risk management, trading analytics, and user-facing AI assistants — directly relevant to the Tech Seeds 2026 program.
💡 Show genuine interest: Pick one product you actually tried — "I explored Binance Earn last week to understand the yield products" is worth 10× more than just listing things you Googled.
  • Problem: Traditional finance requires a trusted intermediary (bank, clearinghouse) to verify and record transactions — creating centralisation risk, high fees, and exclusion for the unbanked.
  • Solution: A blockchain is a distributed ledger replicated across thousands of nodes. No single entity controls it.
  • How it works: Transactions are grouped into blocks. Each block contains a cryptographic hash of the previous block, chaining them together. Altering any block invalidates all subsequent blocks — making tampering computationally infeasible.
  • Consensus: Nodes agree on the valid chain via PoW or PoS — replacing the need for a trusted central authority.
  • Proof of Work (PoW): Miners compete to solve a computationally hard puzzle. Winner adds the next block. Security comes from the energy cost of attacking. Extremely energy-intensive. Used by Bitcoin.
  • Proof of Stake (PoS): Validators chosen proportional to crypto staked as collateral. Bad behaviour means stake is "slashed". Security comes from economic loss of attacking. Much more energy-efficient. Used by Ethereum (post-merge).
  • Binance relevance: BNB Chain uses Proof of Staked Authority (PoSA) — a PoS variant with a smaller validator set for higher throughput, suitable for high-frequency trading and DApps.
  • Definition: Code that runs on a blockchain. Once deployed, it executes automatically when conditions are met — no intermediary needed. Immutable and transparent.
  • What they enable: DeFi protocols (lending/trading without a bank), NFTs (digital ownership records), DAOs (token-holder governance), stablecoins.
  • Simple example: An escrow smart contract holds funds until both parties confirm delivery. No lawyers, no banks — just code enforcing the agreement.
  • Risk: Bugs in smart contracts are permanent and can be exploited. There's no "call customer service" to reverse a bad transaction.
  • DeFi: Financial services running on smart contracts — no company owns or operates them. Anyone can use them permissionlessly. Examples: Uniswap (DEX), Aave (lending).
  • CeFi (like Binance): A company operates the exchange, holds custody of user funds, handles KYC/AML compliance, and provides customer support.
  • Trade-offs: DeFi is permissionless with self-custody but harder UX and smart contract risk. CeFi offers better UX, deeper liquidity, customer support — but requires trust in the company.
  • Binance's position: Primarily CeFi, but also operates BNB Chain ecosystem to capture DeFi users.
  • Web1: Read-only. Static pages. Users consume content.
  • Web2: Read-write. Social platforms (Facebook, YouTube). Users create content but platforms own the data and monetise it. Centralised control.
  • Web3: Read-write-own. Users own their data, digital assets, and identities via blockchain. Value accrues to users, not intermediaries.
  • Key primitives: Digital wallets (identity), NFTs (digital ownership), tokens (incentive alignment), smart contracts (trustless transactions), DAOs (decentralised governance).
  • Trading agents: AI that monitors on-chain data, news sentiment, and technical signals to execute trades autonomously.
  • Risk monitoring: Agents scanning DeFi protocols for anomalous behaviour (liquidity drains, unusual large withdrawals) and auto-alerting risk teams.
  • Customer support: AI that reads a user's wallet history and answers "why did my transaction fail?" with specific on-chain context.
  • Compliance automation: Agents monitoring transactions for AML/CFT patterns and flagging suspicious activity for human review.
  • Research synthesis: Your ASX chatbot concept applied to crypto — agents synthesising on-chain data, news, and fundamentals for retail investors.
  • What: Crypto assets designed to maintain a stable value, usually pegged 1:1 to USD. Examples: USDT (Tether), USDC, DAI (algorithmic).
  • Why important: Safe haven during crypto volatility; DeFi primitive for lending/yield farming/liquidity pools; faster and cheaper cross-border payments than SWIFT.
  • Binance angle: BUSD was Binance's own stablecoin until regulatory pressure in 2023 led to its discontinuation — shows that stablecoin regulation is a live, evolving issue.
  • Risk: Algorithmic stablecoins (UST/Luna) can collapse — the 2022 Terra crash wiped $40B+.
  • Definition: The economic model of a crypto token — how it's created, distributed, used, and what gives it value.
  • Key components: Total supply (fixed like Bitcoin's 21M, or inflationary), distribution (team / investors / public sale / ecosystem fund), utility (gas, governance, fee discounts), burn mechanisms that reduce supply over time.
  • BNB example: BNB used for trading fee discounts on Binance, gas on BNB Chain. Binance burns BNB quarterly based on trading volume — a deflationary mechanism.
  • Why it matters: Good tokenomics aligns incentives so users, developers, and validators all benefit from the network growing.
  • Scale: 300M users means extreme query volumes. Latency and cost at that scale are very different challenges from a personal project.
  • Real-time data: Crypto markets operate 24/7. AI systems need to ingest live data with very low latency — stale data in a financial context can cause real harm.
  • Regulatory compliance: Operating in 100+ countries. AI systems touching financial decisions need audit trails, explainability, and sometimes explicit human approval loops.
  • Adversarial users: Some users will actively try to manipulate AI systems — prompt injection, gaming recommendation algorithms, etc.
  • Trust and safety: Hallucinated financial information at scale could cause real harm. Quality requirements are much higher than a typical consumer chatbot.