Skip to content

Conversation

@franciscojavierarceo
Copy link
Member

@franciscojavierarceo franciscojavierarceo commented Dec 19, 2025

What this PR does / why we need it:

Improves speed of embedding transformation so static artifacts are loaded at build time.

Enables loading models, lookup tables, and other static resources once at server startup instead of per-request, improving performance for real-time inference.

Changes

  • Core: Adds load_static_artifacts() function to feature server with FastAPI lifespan integration
  • Template: Updated PyTorch NLP template with static_artifacts.py module and pre-loaded artifact usage
  • Tests: Added 6 comprehensive unit tests covering success, error handling, and persistence
  • Docs: Added static artifacts section to Python feature server documentation

Which issue(s) this PR fixes:

N/A

Misc

…Feature Transformations

Signed-off-by: Francisco Javier Arceo <[email protected]>
…Feature Transformations

Signed-off-by: Francisco Javier Arceo <[email protected]>
…Feature Transformations

Signed-off-by: Francisco Javier Arceo <[email protected]>
Signed-off-by: Francisco Javier Arceo <[email protected]>
Copy link
Collaborator

@shuchu shuchu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

Copy link
Member

@ntkathole ntkathole left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good 👍

franciscojavierarceo and others added 3 commits December 22, 2025 11:35
Signed-off-by: Francisco Javier Arceo <[email protected]>
Signed-off-by: Francisco Javier Arceo <[email protected]>
@franciscojavierarceo
Copy link
Member Author

CI was passing before the AWS issue. Going to merge.

@franciscojavierarceo franciscojavierarceo merged commit edefc3f into master Dec 23, 2025
19 of 20 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants