ミドルウェア - RAIL Score

ミドルウェアは、すべての AI レスポンスをインターセプトし、アプリケーションの他の部分に届く前に RAIL スコアを付与するパターンです。AI クライアントを RAIL ラッパーに置き換えます。ラッパーはモデルを呼び出し、レスポンスを評価し、コンテンツとスコアの両方を 1 つのオブジェクトで返します。

Python SDK: インテグレーションリファレンス | API: 評価エンドポイント

解決する課題

ミドルウェアがないと、すべての AI 呼び出しに責任ある AI のチェックを追加するには、モデルを呼び出すすべての箇所に評価コードを書くことになり、ロジックの重複、カバレッジの抜け、アプリケーションコードの煩雑化を招きます:

# Eval code scattered everywhere
async def get_response(user_message):
    response = await openai_client.chat.completions.create(
        model="gpt-4o", messages=[{"role": "user", "content": user_message}]
    )
    content = response.choices[0].message.content

    # Must remember to eval in every function
    score = rail_client.eval(content=content, mode="basic")
    if score.rail_score.score < 7.0:
        raise ValueError("Response below quality threshold")

    return content

仕組み

RAIL ラッパーのメソッドを呼び出すと、3 つのことが透過的に行われます:

メッセージは通常の API 呼び出しとして、基盤となるモデルの API へ転送されます。
モデルのレスポンスは、設定したモードで RAIL 評価エンドポイントに送信されます。
元のコンテンツ、RAIL スコア、次元ごとのスコア、threshold_met のブール値をすべて 1 つの戻り値に含む、ラップされたレスポンスオブジェクトが返されます。

サポートされているプロバイダ

ラッパー	ラップ対象	Python	JavaScript
`RAILOpenAI`	OpenAI chat completions	はい	はい
`RAILGemini`	Google Gemini	はい	はい
`RAILAnthropic`	Anthropic Claude	はい	はい
`RAILLangChain`	任意の LangChain モデル	はい	—
カスタムラッパー	任意の HTTP ベースのモデル	はい	はい

観察モードとエンフォースモード

観察のみ
しきい値のエンフォース
自動再生成

すべてのレスポンスをスコアリングしますが、ブロックはしません。レスポンスのフローを中断せずに品質を測定したいときに使います。

client = RAILOpenAI(
    openai_api_key="...",
    rail_api_key="...",
    eval_mode="basic",
    # No threshold — always returns response
)

response = await client.chat(messages=[...])
print(response.content)        # The model's response
print(response.rail_score)     # RAIL score (always present)
print(response.threshold_met)  # None — no threshold configured

レスポンスが基準を満たさない場合に ThresholdError を発生させます。

client = RAILOpenAI(
    openai_api_key="...",
    rail_api_key="...",
    eval_mode="basic",
    threshold=7.0,
)

try:
    response = await client.chat(messages=[...])
    return response.content
except ThresholdError as e:
    # e.rail_score and e.failed_dimensions are available
    return fallback_response()

レスポンスがしきい値を下回ったときに、安全な再生成を自動的にトリガーします。

client = RAILOpenAI(
    openai_api_key="...",
    rail_api_key="...",
    eval_mode="basic",
    threshold=7.0,
    on_fail="regenerate",
    max_iterations=3,
)

# Returns the best content — original or regenerated
response = await client.chat(messages=[...])
print(response.content)
print(response.iterations_taken)  # 1 if original passed

カスタムミドルウェアの作成

組み込みラッパーのないモデルプロバイダを使う場合は、コアの eval() 呼び出しを使って独自のミドルウェアを構築します:

from rail_score_sdk import RailScoreClient

rail = RailScoreClient(api_key="...")

async def rail_middleware(model_call, messages, threshold=7.0):
    """Generic RAIL middleware for any async model call."""
    content = await model_call(messages)

    result = rail.eval(content=content, mode="basic")

    if result.rail_score.score < threshold:
        raise ValueError(
            f"Response scored {result.rail_score.score:.1f} — below threshold {threshold}. "
            f"Failed: {[d for d, s in result.dimension_scores.items() if s.score < threshold]}"
        )

    return content, result

# Use with any model:
content, score = await rail_middleware(my_model_call, messages, threshold=7.5)

次のステップ

コンセプト: ポリシーエンジン

セッション全体でスコアに基づきアクションを取る宣言的ルール。

Python: インテグレーション

プロバイダラッパーの完全なドキュメントとオプション。

JavaScript: プロバイダ

OpenAI、Gemini、Anthropic 用の TypeScript ラッパー。

Python: ミドルウェア SDK

RAILMiddleware — 任意のモデル関数をラップします。

​解決する課題

​仕組み

​サポートされているプロバイダ

​観察モードとエンフォースモード

​カスタムミドルウェアの作成

​次のステップ

コンセプト: ポリシーエンジン

Python: インテグレーション

JavaScript: プロバイダ

Python: ミドルウェア SDK

解決する課題

仕組み

サポートされているプロバイダ

観察モードとエンフォースモード

カスタムミドルウェアの作成

次のステップ