ADD scalar_model #292

michaelteboul · 2025-12-02T14:59:52Z

Description of the goal of the PR

Description:

Changes this PR introduces (fill it before implementation)

A new Attention model for basket modelling using dot product instead of distance for embedding

Checklist before requesting a review

I have commented my code, particularly in hard-to-understand areas
I have typed my code
I have created / updated the docstrings
I have updated the README, if relevant
I have updated the requirements files if a new package is used
I have tested my code
The CI pipeline passes
I have performed a self-review of my code

gemini-code-assist · 2025-12-02T15:00:22Z

Summary of Changes

Hello @michaelteboul, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a novel self-attention model, SelfAttentionModelScalar, designed to improve item recommendations within a basket modeling context. The primary innovation is its use of dot products for calculating embedding similarities, moving away from traditional distance-based approaches. This change aims to enhance the model's ability to learn both immediate (short-term) and enduring (long-term) user preferences, with additional mechanisms to account for price and store-specific influences on purchasing decisions.

Highlights

New Model Introduction: A new SelfAttentionModelScalar has been added for basket recommendation, which is inspired by existing self-attention models but utilizes dot products instead of distance metrics for embedding similarity.
Configurable Latent Sizes: The model allows for configurable latent sizes for short-term, long-term, and price embeddings, providing flexibility in capturing different aspects of user and item preferences.
Enhanced Utility Calculation: The utility computation now incorporates optional item intercepts, price effects, and store effects, allowing for a more nuanced understanding of item desirability.
Robust Negative Sampling and Loss: The implementation includes a method for sampling negative items that are distinct from already purchased items and the next item, and the loss function uses binary cross-entropy with L2 regularization for training.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a new SelfAttentionModelScalar for basket modeling, utilizing a dot product for embeddings instead of distance. The changes primarily involve adding this new model class, which inherits from BaseBasketModel. The implementation includes attention mechanisms, short-term and long-term utility calculations, and a custom loss function. While the overall structure is sound, several docstring inaccuracies, shape mismatches in comments, and a critical logical error in the compute_psi method's tf.einsum operations need to be addressed to ensure correctness and maintainability.

gemini-code-assist · 2025-12-02T15:02:38Z