TTB Label Verification System

A web application that simulates TTB (Alcohol and Tobacco Tax and Trade Bureau) label verification by comparing form inputs with OCR-extracted text from alcohol label images.

🚀 Live Demo

Production URL: https://ttb-pied.vercel.app/

📋 Overview

This system helps verify that alcohol label information matches TTB application form data by:

Extracting text from uploaded label images using OCR (Tesseract.js)
Comparing extracted information with form inputs
Providing detailed match/mismatch reporting
Checking for required government warning text

📁 Project Structure

ttb/
├── src/
│   ├── app/                  # Next.js app router pages and API routes
│   │   ├── api/ocr/         # API endpoints for OCR processing
│   │   ├── components/       # React components
│   │   ├── lib/             # Business logic (verification)
│   │   ├── types/           # TypeScript type definitions
│   │   ├── utils/           # Utility functions (OCR, text processing)
│   │   └── __tests__/       # Integration tests
│   ├── components/__tests__/ # Component unit tests
│   ├── lib/__tests__/       # Library unit tests
│   └── utils/__tests__/     # Utility unit tests
├── docs/                    # Documentation
│   ├── ARCHITECTURE.md      # System architecture details
│   ├── TESTING.md           # Comprehensive testing guide
│   ├── CR.md                # Code review documentation
│   └── HIGH.md              # High-level requirements
├── public/                  # Static assets
└── testimages/              # Test image assets

✨ Features

TTB Form Interface - Complete form with brand name, product class, alcohol content, and net contents
Drag-and-Drop Image Upload - Easy image upload with preview functionality
Triple OCR Support - Choose between Tesseract.js (client-side), Google Cloud Vision API (server-side), or Google AI Studio (Gemini AI)
Intelligent Verification - Fuzzy matching with tolerance for OCR errors
Detailed Results - Comprehensive reporting with visual indicators
Error Handling - Graceful handling of invalid images and processing failures

🛠️ Technology Stack

Frontend: Next.js 16 with React 19 and TypeScript
Styling: Tailwind CSS v4
OCR: Triple provider support
- Tesseract.js (client-side OCR with WebAssembly)
- Google Cloud Vision API (server-side via API routes)
- Google AI Studio (Gemini AI via direct API calls)
Testing: Jest with React Testing Library, 80%+ code coverage
Deployment: Vercel
File Handling: Native File API with type validation

🚀 Quick Start

Prerequisites

Node.js 18+
npm, yarn, pnpm, or bun

Installation

Clone the repository

git clone https://github.com/chasekb/ttb.git
cd ttb

Install dependencies

npm install
# or
yarn install
# or
pnpm install

Start development server

npm run dev
# or
yarn dev
# or
pnpm dev

Open your browser Navigate to http://localhost:3000

Build for Production

npm run build
npm start

📖 Usage Guide

Step 1: Fill Out TTB Form

Brand Name: Enter the exact brand name from your TTB application
Product Class/Type: Select from dropdown (Bourbon, Vodka, IPA, etc.)
Alcohol Content (ABV): Enter percentage (0-100%)
Net Contents: Optional volume information (e.g., "750 mL", "12 fl oz")
OCR Provider: Choose between Tesseract.js (client-side), Google Cloud Vision API (server-side), or Google AI Studio (Gemini AI)

Step 2: Upload Label Image

Drag and drop an image file or click to browse
Supported formats: JPEG, PNG, GIF, WebP
File size limits vary by OCR provider:
- Tesseract.js: Up to 50MB
- Google Cloud Vision API: Up to 20MB
- Google AI Studio: Up to 20MB
Image should be clear and readable for best OCR results

Step 3: Review Results

✅ Verification Passed: All information matches the label
❌ Verification Failed: Issues found with specific details
Detailed breakdown shows match status for each field

🔧 API Documentation

Components

`TTBForm`

interface TTBFormProps {
  onSubmit: (data: TTBFormData) => void;
  isLoading?: boolean;
}

`ImageUpload`

interface ImageUploadProps {
  onImageSelect: (file: File) => void;
  isLoading?: boolean;
}

`ResultsDisplay`

interface ResultsDisplayProps {
  result: VerificationResult;
  onRetry: () => void;
}

Types

`TTBFormData`

interface TTBFormData {
  brandName: string;
  productClass: string;
  alcoholContent: number;
  netContents?: string;
  ocrProvider?: OCRProvider;
}

`OCRProvider`

type OCRProvider = 'tesseract' | 'google-cloud-vision' | 'google-ai-studio';

`VerificationResult`

interface VerificationResult {
  brandName: { match: boolean; extracted: string; expected: string };
  productClass: { match: boolean; extracted: string; expected: string };
  alcoholContent: { match: boolean; extracted: number; expected: number };
  netContents?: { match: boolean; extracted: string; expected: string };
  governmentWarning: { found: boolean; text?: string };
  overallMatch: boolean;
}

🔍 Verification Logic

Matching Criteria

Brand Name: Case-insensitive fuzzy matching
Product Class: Fuzzy matching with variations (e.g., "Kentucky Straight Bourbon" vs "Bourbon")
Alcohol Content: Within ±0.1% tolerance
Net Contents: Fuzzy matching for volume text
Government Warning: Must contain required warning text

Text Processing

The system uses intelligent text processing to handle OCR variations:

Normalization: Removes punctuation and converts to lowercase
Pattern Matching: Recognizes alcohol percentages and volume measurements
Fuzzy Matching: Handles OCR errors and text variations

⚠️ Known Limitations

OCR Accuracy

OCR accuracy depends on image quality and text clarity
Handwritten text may not be recognized accurately
Low-resolution images may produce poor results

Text Extraction

Complex label layouts may confuse text extraction
Stylized fonts may not be recognized properly
Background patterns can interfere with text recognition

Verification Logic

Fuzzy matching may produce false positives
Government warning detection relies on keyword matching
Product class variations may not be comprehensive

Browser Compatibility

Requires modern browsers with WebAssembly support
Large images may cause performance issues on mobile devices
OCR processing is CPU-intensive and may be slow on older devices

🚀 Deployment

Vercel Deployment

Connect to Vercel
```
npx vercel
```
Deploy to Production
```
npx vercel --prod
```

Environment Variables

For Tesseract.js OCR (Client-Side)

No environment variables are required. Tesseract runs locally in the browser with WebAssembly support.

For Google Cloud Vision API

Create a .env.local file with your Google Cloud credentials:

# Google Cloud Project ID
GOOGLE_CLOUD_PROJECT_ID=your-project-id

# Google Cloud Service Account Email
GOOGLE_CLOUD_CLIENT_EMAIL=your-service-account@your-project.iam.gserviceaccount.com

# Google Cloud Private Key (replace \n with actual newlines)
GOOGLE_CLOUD_PRIVATE_KEY="-----BEGIN PRIVATE KEY-----\nYOUR_PRIVATE_KEY_HERE\n-----END PRIVATE KEY-----\n"

For Google AI Studio (Gemini)

Create a .env.local file with your Google AI Studio API key:

# Google AI Studio API Key
GOOGLE_AI_API_KEY=your-api-key-here

Setting up Google AI Studio

Go to Google AI Studio
- Visit https://aistudio.google.com/
Create API Key
- Click on "Get API key" in the left sidebar
- Create a new API key or use an existing one
Copy API Key
- Copy the generated API key
- Add it to your .env.local file as GOOGLE_AI_API_KEY

Setting up Google Cloud Vision API

Go to Google Cloud Console
- Visit https://console.cloud.google.com/
Create or Select Project
- Create a new project or select an existing one
Enable Vision API
- Navigate to "APIs & Services" > "Library"
- Search for "Cloud Vision API" and enable it
Create Service Account
- Go to "IAM & Admin" > "Service Accounts"
- Click "Create Service Account"
- Give it a name and description
- Grant "Cloud Vision API User" role
Download Credentials
- Click on the service account
- Go to "Keys" tab
- Click "Add Key" > "Create new key" > "JSON"
- Download the JSON file
Extract Credentials
- Open the downloaded JSON file
- Copy project_id, client_email, and private_key
- Add them to your .env.local file

🧪 Testing

This project includes comprehensive testing with Jest and React Testing Library. The test suite covers unit tests, component tests, integration tests, and end-to-end workflow tests.

Running Tests

# Run all tests
npm test

# Run tests in watch mode
npm run test:watch

# Generate coverage report
npm run test:coverage

# Run tests in CI mode
npm run test:ci

Test Coverage

Current test coverage: ~53% overall (target: 80%+)

Coverage Breakdown:

Unit Tests: Utility functions and business logic ✅
Component Tests: React component behavior ✅
Integration Tests: OCR provider integration ✅
API Route Tests: Server-side endpoints 📋 (planned improvements needed)

Test Structure

src/
├── __tests__/
│   ├── accessibility.test.tsx    # A11y testing with axe-core
│   ├── integration.test.tsx      # OCR provider integration
│   └── performance.test.tsx      # Performance benchmarks
├── components/__tests__/
│   ├── ImageUpload.test.tsx
│   ├── ResultsDisplay.test.tsx
│   └── TTBForm.test.tsx
├── utils/__tests__/
│   ├── ocr.test.ts
│   └── textProcessing.test.ts
├── lib/__tests__/
│   └── verification.test.ts
└── app/api/__tests__/
    └── ocr/google-cloud-vision/

Manual Testing Checklist

Test with various label images (different formats, sizes)
Verify matching scenarios (exact matches, fuzzy matches)
Test mismatch detection (wrong brand, wrong ABV, etc.)
Validate error handling (invalid images, no text found)
Test government warning detection
Verify responsive design on different screen sizes

Test Images

For testing, use clear, high-resolution images of alcohol labels with:

Readable text
Visible alcohol percentage
Government warning text
Brand name and product type

Additional Resources

For detailed testing information, see docs/TESTING.md.

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Tesseract.js for OCR capabilities
Next.js for the React framework
Tailwind CSS for styling
Vercel for deployment platform

📚 Documentation

Additional Resources

docs/ARCHITECTURE.md - Detailed system architecture and design decisions
docs/TESTING.md - Comprehensive testing guide and coverage reports
docs/CR.md - Code review guidelines and standards
docs/HIGH.md - High-level system requirements and specifications

📞 Support

For questions or issues, please:

Check the Known Limitations section
Review the detailed documentation in the docs/ directory
Review existing GitHub Issues
Create a new issue with detailed information

Note: This is a demonstration system for TTB label verification. It uses OCR technology to extract text from alcohol label images and compare it with form data. For production use, additional validation and compliance checks would be required.

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.github/workflows		.github/workflows
docs		docs
public		public
src		src
.gitignore		.gitignore
.lighthouserc.json		.lighthouserc.json
README.md		README.md
eslint.config.mjs		eslint.config.mjs
jest.config.js		jest.config.js
jest.setup.ts		jest.setup.ts
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

chasekb/ttb

Folders and files

Latest commit

History

Repository files navigation

TTB Label Verification System

🚀 Live Demo

📋 Overview

📁 Project Structure

✨ Features

🛠️ Technology Stack

🚀 Quick Start

Prerequisites

Installation

Build for Production

📖 Usage Guide

Step 1: Fill Out TTB Form

Step 2: Upload Label Image

Step 3: Review Results

🔧 API Documentation

Components

TTBForm

ImageUpload

ResultsDisplay

Types

TTBFormData

OCRProvider

VerificationResult

🔍 Verification Logic

Matching Criteria

Text Processing

⚠️ Known Limitations

OCR Accuracy

Text Extraction

Verification Logic

Browser Compatibility

🚀 Deployment

Vercel Deployment

Environment Variables

For Tesseract.js OCR (Client-Side)

For Google Cloud Vision API

For Google AI Studio (Gemini)

Setting up Google AI Studio

Setting up Google Cloud Vision API

🧪 Testing

Running Tests

Test Coverage

Test Structure

Manual Testing Checklist

Test Images

Additional Resources

🤝 Contributing

📄 License

🙏 Acknowledgments

📚 Documentation

Additional Resources

📞 Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`TTBForm`

`ImageUpload`

`ResultsDisplay`

`TTBFormData`

`OCRProvider`

`VerificationResult`

Packages