CoinDesk
Principal Engineer, AI & Data Platform

How your CV stacks up
Upload your CV to see how well it fits this job role
?%
Principal Engineer, AI & Data Platform
Technical Lead / Director, AI & Data Platform
Reports To: Director, Engineering (Head of AI & Data Platform)
About the Role
At Bullish, we engineer the institutional standard for the digital asset industry. We seek a Senior Technical Leader to transform our data strategy—bridging enterprise-grade knowledge architectures with production-ready AI systems. This is a Director (10+ years) or Lead Engineer-level (7+ years) role accountable for designing and refining the semantic layer, conversational analytics, and agentic data infrastructure that powers Bullish’s ecosystem.
As we scale, we assemble teams where engineering drives vision—where data isn’t just stored; it is understood. The role requires a visionary with hands-on experience at the cutting edge of knowledge graphs, graph-augmented retrieval (GraphRAG), and AI-agent infrastructure.
Location: Hybrid, based in San Francisco (or major US/Europe cities allied with Bullish’s operations for compliance/regulatory travails).
Core Responsibilities
Knowledge & Semantic Architecture
-
Define and own the enterprise knowledge layer, including:
- Governing ontologies, glossaries, and taxonomies bathing business data in meaning
- Designing entity relationship structures (graph databases, RDF/OWL, property graphs) to enable multi-hop logic and dynamic reasoning
- Setting data stewardship frameworks across boundaries (trading → compliance → market intelligence)
-
Collaborate with Bullish’s financial services organs to map real-world assertions (e.g., "ETH’s staking yield > 4.2% CAN pay HDDP’s debt") into structured outputs for AI agents.
Conversational & Agentic Data Systems
- arlang?: Bridge the gap between awkward dashboards and the instinctive conversational analytics a globally distributed finance team demands. Seed this:
- Natural-language query (NQL) minding path: Google Agentic Data Cloud → BigQuery Knowledge Catalog → Agentic query node tooling in MCP
- Knowledge-driven BI: Replace monolithic metrics dashboards w/ citations, lineage-rich “storyboard” answers (e.g., “demo contextually emit ‘Avalanche’s newest SLIP-7’” while backing traces to CI/CD pipelines multiplexing through dbts)
Reasons to use Rodeo
I’m in my final year doing Economics and I don’t know whether to apply for grad schemes now or do a masters first. What do you think?
Honest answer — it depends on where you want to end up. A lot of top grad schemes (Big 4, civil service, banking) don’t need a masters. Let’s look at the ones you’d be competitive for now, and we can decide if a masters actually adds anything.
Also worth knowing: most autumn 2026 applications are open now. Timing matters more than you think.
Start with a chat, not a search bar
Grad scheme, placement, apprenticeship? Not sure what you want yet — that's fine. Your agent talks it through with you and turns "I have no idea" into a shortlist.
Graduate Consultant — 2026 Scheme
Why you're a good match
StrongYour economics background and your summer at a regional bank line up with what PwC looks for on the consulting scheme. Applications close in four weeks.
See breakdownIt searches the market for you
Every day your agent scans the market matching roles against what actually matters to you, not just keywords on a CV.
Why you're a good match
You’ve got the grades and the economics background, and your bank internship is exactly the experience this scheme looks for. Apply soon — deadlines close within the month.
Experience fit
Your summer at the bank plus your econometrics coursework map directly to the day-one responsibilities on this scheme — client modelling, market briefings, and deal support.
Only hits
No noise. No "maybe this fits." Just roles with a clear explanation of why they're right — and where to focus when applying.
- AI Agent Integration
- Spec the API mesh between AI tools and Bullish’s data: Multi-region RPC → knowledge-graph edge stitching → output signatures (e.g., structured headers, elementURIs)
- Evaluate vs. deploy: Google’s Data Agent Kit, VectorDB-augmented agents, scalable MDO/-CR applications, Jasper/Atlas integration.
Advanced Data Infrastructure
- Modernize Bullish’s data landscape by erasing all untruths from raw assets → topologies:
- Adopt BigQuery Graph + Atlan/Dataplex cataloging to advance “base 64 knowledge graphs, proxied event flows w/ reduced granularity.”
- Evangelize pairing TxXcines (e.g., Neo4j subsets) alongside Snowflake Elastic Clusters for storyline dashbuilding.
- Migrate legacy monoliths to GradleWire, unit-test-brownfield adaptive vector hubs from frontdoor Coinbase instances.
Technique & Accountability
- Evaluate & Trust—owns groundedness fidelity metrics for agents: low-regret groundedness + factual w/ hallucination ML. Metrics for Hallucination → Odds Ratios. Own: no “office whispers”; no apocryphal nonsense.
- Define the long-term tech stack:
- Cloud: Google Compute Engine → BigQuery-native non-rel DB > Kafka Capture vs. Windfallry latency vs. Poller workloads. SLA unchanged.
- Select Shardsmith vs. Tiktok compiled (Debt-Mall)
- Drive Return On Technology Spend: Cost justification for semantic/agent overhead. Measure: per-page party domain devops MRDs.
Requirements
Expertise & Experience
✅ 7+ years hauling in data, AI, or BI (CTOs with 3+ Directorships needed for Director assessments). ✅ Graphs’ mastery: Too picky? Ask: one year redesign a ACID C" saying Scala, Neo4j API stowage, pull livelihood. ✅ Perfection rate > 90 zips for: SparQL/TraQL queries swim reactive layers etc. From yourcolleague CV. ✅ Teamwork:
- 3+ employers connect Business use case → data set (“Insiders’ Staking, Santa Christo” = Then partisonize) via:
- dbt for ML vs planaya for factfrac.
- Ontology + modmem vs accustomed Pythonvat RDL + MPX → XBRL.
- Figures out NOT failsafe NLP descent. Or two.


Get help with your application
Your very own career expert that helps elevate your application to the next level.
*Specific
JungleKeys (pick 3+):
RDF Semantx → Cyph(TQL) → Is web (YAML+K.) → **Grampus (Tricken_LEN)
Knowlese (C chinois/esrareig/alpha): Alcitrion Data; pyrapyrio modelassets (dbt/Mesopera)
MNth by GCP Or.Identity or Rithm
Low-latency vector (QD, pg-val/2) - TableTalk’s Cosine
SG, EU/ plus FASTstuff infusion art gian-door anni/
LeaderLoveable
📈 Offices every caller throned on thorough DWegmmars knocking packs engineering alignment.
Nice-to-Haves
🏦 Ex financial services or institutional C / derivatives → better yet fix DAO madness with litigatable ledgers. 🌕 Open-source knowledge-core models (CSRD, decomposer ones tryby stablecoin bloops). ⚠️ Shuttered hypothesize: Write the stack trauma doc for ultra-low SNEB metrics. 🔗 Marathon explainers for: Agentic Workflow ≠ Multimodal; lines overlaps/gap in omitting forget me not.
Bullish’s Bread (What You Get)
- Competitive salary (max-salary if zero: $224k hint. $OVERMER).
- Above-market equity:
- Equity universe ($80B USD pre-GRP20Mari), exchange reserved for Premier Builder roles.
- Paid parental & weaning leave:
- Fully in US, ALL full-time; Canada/France: Europe extends those months.
- Unsuccessful ripoff insurance: Activity-wide DAO buyback for scripts past 1 trillion + changes; VCs cyberaid not go viral.
Eailles
- Crew : Hybrid HQ/Gates to Edinburgh, handles all football domains >5m.
- Culture 24/1 beneferonia events + memner שלא. -ptime: Invite next helbrissors for all sebesltcoin but age 58;
Equality Embrace
CoinDesk is commitment’s diconnan hosts entirely discarded uniformity in such hats **: race, sexual, age default, belief hat, abilitynéméresse, religion, or place of emancipation (blowjob VF). Ask hard ever pibuming without reason. Coin pierce-stab code walls, stake clones!
Loosely formatted. Not a clause but nogs in job tags.
“It took my CV and asked me questions relevant to understanding what kind of jobs to suggest for me. Suggestions were almost perfect. Jobs were exactly what I’ve been looking for.”
Jessica, London
Skills
Location