Box.com Discovery Bates Namer: Cloud Integration

Box.com Discovery Bates Namer: Cloud Integration
Migration from Desktop Tool to Serverless AWS Workflow

Project Sponsor:
Deepak Budwani, Chief Financial and Administrative Officer – Office of the Public Defender, Santa Barbara County

Project Managers and Primary Project Contact:
AJ Voisan

Project Team and Subject Matter Experts:
Angie Stokke, Sarah Rothschild, Luis Ramirez, and Jalena Jones

Description

As the Office of the Public Defender continues its transition to a fully paperless case management system, we have adopted Box.com as our secure, CJIS- and HIPAA-compliant cloud content management solution. This platform allows for efficient digital file storage, internal collaboration, workflow automation, and integration with external systems like our case management software, eDefender.

With data ingestion exceeding 5–6 terabytes of electronically stored information (ESI) each month, automating file organization and metadata handling has become critical. One key project this term focuses on automating the discovery file workflow—specifically, parsing uploaded PDFs for Bates stamps, renaming those files, and transferring them to the appropriate locations in Box based on their case number.

In a prior phase of this project, a standalone desktop application was developed to assist with Bates parsing and file renaming. However, the manual workflow—downloading files, running the tool locally, and re-uploading renamed documents into Box—was inefficient and time-consuming. To address this, our team migrated the solution into a fully serverless cloud environment using AWS Lambda.

Now, when users upload discovery materials into a designated folder in Box, the system is triggered via webhook. AWS Lambda functions validate the input, extract Bates stamp ranges from the file content, rename the files accordingly, and move them to the appropriate destination folders. The system handles errors gracefully, distinguishing between soft and critical failures, and notifies the uploader via email if any issues occur.

This automation ensures consistent file naming and precise placement within the Box folder structure, dramatically reducing the manual effort and potential for human error in discovery processing. By eliminating the need to download, locally process, and re-upload files, the system not only saves time and computing resources, but also reduces risk by minimizing unnecessary file transfers between systems, thereby removing vulnerability of potential man in the middle attacks. We migrated the original desktop application into a fully serverless, cloud-native pipeline powered by AWS Lambda—delivering an end-to-end solution that is faster, more scalable, and inherently more secure. With built-in error detection, graceful fallback handling, and real-time email notifications, the system is both robust and user-friendly. Ultimately, this project streamlines a critical workflow, improves operational safety and reliability, and lays the foundation for future enhancements in automated document handling and metadata intelligence.

 

 

 

Role Name e-mail
Faculty Advisor Jungsoo Lim jlim34@calstatela.edu
Project Lead Santiago Del Rio Obando sdelri10@calstatela.edu
Customer liaison/requirements lead. Jesus Alcocer jalcoc11@calstatela.edu
Architecture/design lead Jian Verdad jverdad@calstatela.edu
UI Lead Alexander Mendez amende140@calstatela.edu
Backend Lead Justin Bradshaw jbradsh3@calstatela.edu
QA/QC lead Yongkang Liu yliu160@calstatela.edu
Documentation Lead Gilbert Perez-Sanchez gperezs7@calstatela.edu
Demo Lead Anthony Diaz adiaz338@calstatela.edu
Presentation Lead Anthony Gonzalez agonza362@calstatela.edu

 

Meetings Date Time
Weekly advisor group meeting Friday 8 AM - 9:00 AM
Bi-Weekly Liaison Meeting Friday 9 AM - 10:00 AM
Weekly team meeting Friday 10 - 11 AM
Student Team
  • Jesus Antonio Alcocer
  • Justin Bradshaw
  • Santiago Del Rio Obando
  • Anthony Diaz
  • Anthony Gonzalez
  • Yongkang Liu
  • Alexander Mendez
  • Gilbert Perez-Sanchez
  • Jian Verdad