๐ฆ UniversityDataHub - Example of Data Deliverables
UniversityDataHub example for Lion State University
๐ง 1. Data Definition Language (DDL) Scripts
Complete and normalized SQL DDL scripts for all core, lookup, and relationship tables. Includes:
- dim_person, dim_course, dim_term, dim_department, etc.
- Referential integrity via primary and foreign keys
- Support for Azure SQL, PostgreSQL, MySQL, etc
๐ 2. JSON Payload Structures
Standardized JSON payloads for core entities, ready for API and ingestion pipelines:
{
"person_id": "a1b2c3d4",
"first_name": "Jordan",
"last_name": "Smith",
"major": "Data Science",
"term": "Fall 2025"
}
๐ 3. API Services
Deployed API endpoints via Azure Functions or AWS Lambda:
- Secure read/write access to student records
- Event-driven architecture (e.g., course registrations)
- RBAC and API Gateway support
Example Endpoints:
GET /api/students/{id}
POST /api/courses
GET /api/terms?active=true
๐ 4. Data Dictionary
Full data dictionary including:
- Field names, data types, and constraints
- Lookup values (e.g., Gender, Ethnicity)
- Exportable as CSV, Excel, or HTML
๐ 5. Entity-Relationship Diagrams (ERDs)
Detailed ERDs with shading, relationship lines, and styling for onboarding:
- Core, Lookup, and Cross-reference tables
- One-to-many and many-to-many relationships
๐ 6. YAML & OpenAPI (Swagger) Specs
YAML schemas and Swagger (OpenAPI 3.0) definitions for each domain:
- Autogenerated Swagger UI
- Integration-ready with Postman and Azure API Management
๐งช 7. Test Data & Load Scripts
Realistic sample data sets and bulk loaders:
- 1,000+ rows per domain
- Performance/load testing
- QA/UAT validation
๐ ๏ธ 8. ETL/ELT Pipelines & Metadata
Sample pipelines delivered via:
- Azure Data Factory templates
- Databricks Notebooks
- Source-to-target mappings and lineage
๐ 9. Security, Identity, and Role Models
Role-Based Access Control (RBAC) structure tailored to the specific institution
- Role types, permissions, and user mappings
- Secure access based on
dim_user_account
and dim_role_type