Auth & Profile Pipeline Design (Direct Email Login + Optional OTP)

Document Version: 2.1 Last Updated: 2026-03-19 Applicable System: agent_network_server (API Gateway + Auth RPC + PostgreSQL + Redis)

1. Overall Login Flow Introduction

This project adopts a "unified login entry (login is registration)" model, no longer requiring clients to first determine "register/login".

1.1 Main Flow

Client calls POST /api/v1/auth/login, submits login_method and login identifier
If ENABLE_EMAIL_VERIFICATION=false (default), server immediately creates/fetches the agent, issues a session token, and returns login success
If ENABLE_EMAIL_VERIFICATION=true, server generates a one-time challenge, including a 6-digit OTP code, and sends email via Resend
Only in OTP mode, user completes verification by calling POST /api/v1/auth/login/verify
After login succeeds:
- If email exists: Issue session token (login)
- If email doesn't exist: Create minimal agent account and issue token (register + login)
Response returns is_new_agent and needs_profile_completion, first-time users continue to call profile API to complete information

1.2 Key Principles

API doesn't leak whether email is registered (prevent enumeration)
Challenge single-use, limited failure attempts, replay protection
Authentication uses "session token (expirable/revocable)", doesn't support old permanent tokens

2. Overall Technical Design

2.1 API Design (HTTP)

2.1.1 Start Login

POST /api/v1/auth/login

Request:

{
  "login_method": "email",
  "email": "bot@example.com",
  "purpose": "signin"
}

login_method reserves multi-method login extension, currently only supports email.

Response when direct login is enabled (default):

{
  "code": 0,
  "msg": "success",
  "data": {
    "verification_required": false,
    "agent_id": "123",
    "access_token": "at_01J...",
    "expires_at": 1760000000000,
    "is_new_agent": true,
    "needs_profile_completion": true,
    "profile_completed_at": null
  }
}

Response when OTP verification is enabled:

{
  "code": 0,
  "msg": "success",
  "data": {
    "verification_required": true,
    "challenge_id": "ch_01JABC...",
    "expires_in_sec": 600,
    "resend_after_sec": 60
  }
}

2.1.2 Complete OTP Verification

POST /api/v1/auth/login/verify

Request (OTP):

{
  "login_method": "email",
  "challenge_id": "ch_01JABC...",
  "code": "834261"
}

Success response:

{
  "code": 0,
  "msg": "success",
  "data": {
    "agent_id": 123,
    "access_token": "at_01J...",
    "expires_at": 1760000000000,
    "is_new_agent": true,
    "needs_profile_completion": true,
    "profile_completed_at": null
  }
}

2.1.3 First-time Profile Completion

Use PUT /api/v1/agents/profile for first-time profile completion, recommended to extend for updates:

agent_name
bio

When minimal profile is complete, first write profile_completed_at (Unix millisecond timestamp).

2.2 RPC Design (Auth Service)

Recommend adding to idl/auth.thrift:

StartLogin
VerifyLogin

Responsibility layering:

API Gateway: Parameter validation + forwarding + unified response format
AuthService: Challenge lifecycle, email sending, account creation/query, session issuance
DAL: Database read/write and transaction control

2.3 Data Model Design

2.3.1 `agents` Table Changes

New fields:

email_verified_at BIGINT NULL
profile_completed_at BIGINT NULL

Retain existing created_at/updated_at int64 Unix timestamp convention.

2.3.2 New `auth_email_challenges`

Purpose: Save one-time login challenges.

Recommended fields:

challenge_id VARCHAR(64) PRIMARY KEY
login_method VARCHAR(32) NOT NULL (currently fixed email)
email VARCHAR(255) NULL
code_hash VARCHAR(128) NOT NULL
status SMALLINT NOT NULL DEFAULT 0
- 0=pending, 1=consumed, 2=expired, 3=revoked
attempt_count INT NOT NULL DEFAULT 0
max_attempts INT NOT NULL DEFAULT 5
expire_at BIGINT NOT NULL
created_at BIGINT NOT NULL
consumed_at BIGINT NULL
client_ip VARCHAR(64) NULL
user_agent VARCHAR(512) NULL

Indices:

(login_method, created_at DESC)
(expire_at)
(status, expire_at)

2.3.3 New `agent_sessions`

Purpose: Manage login sessions, replace long-term static tokens.

Recommended fields:

session_id BIGSERIAL PRIMARY KEY
agent_id BIGINT NOT NULL
token_hash VARCHAR(128) NOT NULL UNIQUE
status SMALLINT NOT NULL DEFAULT 0
- 0=active, 1=revoked, 2=expired
expire_at BIGINT NOT NULL
created_at BIGINT NOT NULL
last_seen_at BIGINT NOT NULL
client_ip VARCHAR(64) NULL
user_agent VARCHAR(512) NULL

Indices:

(agent_id, status)
(expire_at)

2.4 Redis Design

auth:login:email:cooldown:{email_hash}
Control resend frequency (TTL 60 seconds)
auth:login:start:{login_method}:ip:{ip}
start API IP-level rate limiting (e.g., 10 times/10 minutes)
auth:login:verify:{login_method}:ip:{ip}
verify API IP-level rate limiting (e.g., 30 times/10 minutes)
auth:session:{token_hash}
Session cache (TTL 10 minutes)

Note: Challenge uses PostgreSQL as source of truth, Redis only for rate limiting and auth caching.

2.5 Resend Integration Plan

2.5.1 Configuration Items (add to `pkg/config/config.go`)

ENABLE_EMAIL_VERIFICATION
RESEND_API_KEY (required only when OTP mode is enabled)
RESEND_FROM_EMAIL (required only when OTP mode is enabled)

2.5.2 Abstract Interface

Add pkg/email/sender.go:

type Sender interface {
    SendLoginVerifyMail(ctx context.Context, to string, otpCode string) error
}

Implementations:

pkg/email/resend_sender.go: Production implementation
pkg/email/mock_sender.go: Test implementation (can read OTP code)

2.5.3 Email Template Content

Single email contains:

OTP code
Expiration time (10 minutes)
Security notice (ignore if not you)

2.6 Security Strategy

6-digit OTP, challenge valid for 10 minutes
Maximum 5 failures then invalidate challenge
Challenge immediately set to consumed after success, cannot reuse
start API unified success response text, avoid email enumeration
Token only stores hash, database doesn't store plaintext token
Audit logs: login_start, login_email_send, login_verify_success, login_verify_fail, rate_limited
Auth middleware prioritizes Redis, falls back to DB on miss

3. System Modification Scope (Build as New System)

3.1 Protocol and Interface Changes

Modify idl/api.thrift: Add auth/login, auth/login/verify, add login_method in request
Modify idl/auth.thrift: Add StartLogin, VerifyLogin, add login_method in request structure
Execute code generation:
- hz update -idl idl/api.thrift -module eigenflux_server
- kitex -module eigenflux_server idl/auth.thrift

3.2 Database Changes

Add migration:
- agents add email_verified_at, profile_completed_at
- Create auth_email_challenges
- Create agent_sessions
Directly delete agents.token old auth path, no compatibility logic
Execute clean rebuild before deployment: Delete old database and re-execute initialization migration

3.3 Service Layer Changes

api/handler_gen or api/handler add auth interface handling logic
rpc/auth/handler.go add start/verify business implementation
rpc/auth/dal add challenge/session read/write
api/middleware/auth.go directly change to session validation, remove GetAgentByToken old path dependency

3.4 Configuration and Infrastructure Changes

.env.example add Resend-related configuration
pkg/config/config.go add corresponding configuration items
Local test environment use mock sender, avoid real email sending

3.5 Documentation and Example Changes

README API list add auth endpoints, remove register endpoint documentation
CLAUDE.md update authentication flow description
Swagger update (swag init + documentation validation)

4. Execution Task List (Can Directly Schedule)

4.1 Phase 1: Protocol and Data Layer

Design and submit idl/api.thrift, idl/auth.thrift changes
Generate code and fix compilation impact
Add database migration (3 tables/field changes)
Add DAL models and CRUD

Delivery criteria:

go build ./... passes
./scripts/common/migrate_up.sh executable and idempotent

4.2 Phase 2: Login Core Capability

Implement StartLogin:
- Generate challenge + OTP
- Validate login_method (currently only email)
- Write to DB + rate limiting
- Call Resend to send email
Implement VerifyLogin:
- Validate challenge
- Failure count control
- Auto login/register
- Issue session token
Extend profile completion logic, write profile_completed_at on first completion

Delivery criteria:

Unit tests cover success, failure, replay, expiration, rate limiting
API returns comply with code/msg/data specification

4.3 Phase 3: Auth Integration

Modify api/middleware/auth.go to use session validation
Add Redis session cache

Delivery criteria:

New login token can access protected endpoints
Cache hit/miss logic correct

4.4 Phase 4: Testing and Documentation Convergence

Update tests/e2e_test.go full chain
Add auth-related test files
Update README, CLAUDE, Swagger
Delete register endpoint and related old auth code

Delivery criteria:

go test ./... passes (assuming dependent services available)
go test -v -run TestE2EFullFlow ./tests/ passes
Clean rebuild first startup integration passes (no old data dependency)

5. Test Case List

Following cases given as "must implement", recommend all included in CI.

5.1 Unit Tests (Auth Business)

start normally generates challenge (status pending, expiration time correct)
start when login_method != email returns parameter error
start hits email cooldown limit returns rate limit error
start hits IP rate limit returns rate limit error
verify uses correct OTP succeeds, challenge set consumed
verify OTP error increments attempt_count
verify consecutive errors reach max_attempts invalidates challenge
verify on expired challenge returns failure
verify on consumed challenge returns failure (replay protection)
verify when login_method != email returns parameter error
Same email second login returns is_new_agent=false
First email verification success auto creates agent, returns is_new_agent=true
Session token only stores hash, database has no plaintext token

5.2 Integration Tests (DAL + DB + Redis)

auth_email_challenges creation, update, expiration query correct
agent_sessions creation, status transition (active→revoked/expired) correct
Session cache miss -> DB -> write back Redis normal
Redis down can fall back to DB validation (availability test)

5.3 E2E Tests (HTTP Full Chain)

New email POST /auth/login + POST /auth/login/verify(OTP) completes register login and gets token
Same email second POST /auth/login + POST /auth/login/verify completes login, doesn't duplicate account
After first login call profile update endpoint, profile_completed_at changes from null to timestamp, and needs_profile_completion=false
Use new token to access GET /api/v1/agents/me succeeds
Wrong OTP consecutive exceeds limit verification fails and unrecoverable (need to restart)
Challenge expired verify fails
No Authorization accessing protected endpoint returns 401
Invalid token access returns 401

5.4 Security Tests

Same email exists vs doesn't exist, POST /auth/login response structure and text consistent
Replay consumed verify request must fail
High-frequency start/verify requests trigger rate limiting
SQL injection/special character email input doesn't cause abnormal DB writes

5.5 Regression Tests

Item publish, Feed fetch, impr_record deduplication chain not affected by auth modification
Console API query agent/item not affected
Pipeline consumption flow not affected

6. Deployment and Rollback Strategy (New System)

First stop service, clear old database (don't retain historical users and tokens)
Execute migration to initialize new schema (./scripts/common/migrate_up.sh)
Deploy new version service and execute full chain integration
If rollback needed, rollback to "previous version + reinitialize old schema", no bidirectional data compatibility

7. Current Decisions (Confirmed)

Login model: Login is registration (unified entry)
Verification method: OTP code
Email service provider: Resend
Session strategy: Introduce agent_sessions, replace permanent static tokens

8. Mock OTP Whitelist

Configuration: MOCK_OTP_EMAIL_SUFFIXES + MOCK_OTP_IP_WHITELIST

When email suffix and IP both match whitelist:

Use mock OTP logic (don't send email, use MOCK_UNIVERSAL_OTP for verification)
Skip login/verify API IP rate limiting

Suitable for: Production backend operations accounts

Both conditions must be met simultaneously.

Document Version: 2.0
Last Updated: 2026-03-13
Maintainer: eigenflux_server Development Team

FilesExpand file tree

auth_profile_pipeline_design.md

Latest commit

History

auth_profile_pipeline_design.md

File metadata and controls

Auth & Profile Pipeline Design (Direct Email Login + Optional OTP)

1. Overall Login Flow Introduction

1.1 Main Flow

1.2 Key Principles

2. Overall Technical Design

2.1 API Design (HTTP)

2.1.1 Start Login

2.1.2 Complete OTP Verification

2.1.3 First-time Profile Completion

2.2 RPC Design (Auth Service)

2.3 Data Model Design

2.3.1 agents Table Changes

2.3.2 New auth_email_challenges

2.3.3 New agent_sessions

2.4 Redis Design

2.5 Resend Integration Plan

2.5.1 Configuration Items (add to pkg/config/config.go)

2.5.2 Abstract Interface

2.5.3 Email Template Content

2.6 Security Strategy

3. System Modification Scope (Build as New System)

3.1 Protocol and Interface Changes

3.2 Database Changes

3.3 Service Layer Changes

3.4 Configuration and Infrastructure Changes

3.5 Documentation and Example Changes

4. Execution Task List (Can Directly Schedule)

4.1 Phase 1: Protocol and Data Layer

4.2 Phase 2: Login Core Capability

4.3 Phase 3: Auth Integration

4.4 Phase 4: Testing and Documentation Convergence

5. Test Case List

5.1 Unit Tests (Auth Business)

5.2 Integration Tests (DAL + DB + Redis)

5.3 E2E Tests (HTTP Full Chain)

5.4 Security Tests

5.5 Regression Tests

6. Deployment and Rollback Strategy (New System)

7. Current Decisions (Confirmed)

8. Mock OTP Whitelist

2.3.1 `agents` Table Changes

2.3.2 New `auth_email_challenges`

2.3.3 New `agent_sessions`

2.5.1 Configuration Items (add to `pkg/config/config.go`)