Skip to content

fix the logic of closing session#4370

Open
lvhan028 wants to merge 1 commit intoInternLM:mainfrom
lvhan028:fix-chat-exit
Open

fix the logic of closing session#4370
lvhan028 wants to merge 1 commit intoInternLM:mainfrom
lvhan028:fix-chat-exit

Conversation

@lvhan028
Copy link
Collaborator

Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily receiving feedbacks. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers.

Motivation

Please describe the motivation of this PR and the goal you want to achieve through this PR.

Modification

Please briefly describe what modification is made in this PR.

BC-breaking (Optional)

Does the modification introduce changes that break the backward-compatibility of the downstream repositories?
If so, please describe how it breaks the compatibility and how the downstream projects should modify their code to keep compatibility with this PR.

Use cases (Optional)

If this PR introduces a new feature, it is better to list some use cases here, and update the documentation.

Checklist

  1. Pre-commit or other linting tools are used to fix the potential lint issues.
  2. The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness.
  3. If the modification has a dependency on downstream projects of a newer version, this PR should be tested with all supported versions of downstream projects.
  4. The documentation has been modified accordingly, like docstring or example tutorials.

Copilot AI review requested due to automatic review settings February 25, 2026 07:17
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR addresses issues with session closing logic in the lmdeploy session manager and includes additional CMake build configuration improvements. The main focus is on fixing how sessions are closed to prevent premature cancellation and ensure proper resource cleanup when the main program exits.

Changes:

  • Enhanced the Session.close() method to wait for async operations to complete with a 5-second timeout
  • Added early exit optimization for sessions that haven't processed any requests yet
  • Improved error logging in async_close() to use logger.exception()
  • Wrapped test executables in BUILD_TEST conditionals in two CMakeLists.txt files

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File Description
lmdeploy/serve/managers/session_manager.py Main fix: Added synchronization to close() method to wait for async operations, early return for unused sessions, and improved error logging
src/turbomind/kernels/CMakeLists.txt Wrapped test_quantization executable in BUILD_TEST conditional
src/turbomind/comm/gloo/CMakeLists.txt Wrapped test_ipc_comm executable in BUILD_TEST conditional
Comments suppressed due to low confidence (1)

lmdeploy/serve/managers/session_manager.py:128

  • The abort() method doesn't wait for the async operation to complete, unlike the new close() implementation. This inconsistency could lead to race conditions where abort() returns before the async_abort() coroutine finishes executing. While the async_abort() method has a comment indicating "DO NOT reset the session here because it might be used by other components," the lack of synchronization in abort() could still cause issues if the caller expects the abort operation to complete before proceeding.

Consider applying similar synchronization logic to abort() as was added to close(), or document why abort() doesn't need to wait for completion while close() does.

    def abort(self):
        """Abort the session in sync mode."""
        self._run(self.async_abort())

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment on lines +37 to +40
if (BUILD_TEST)
add_executable(test_quantization test_quantization.cc gemm/test/test_utils.cu)
target_link_libraries(test_quantization PRIVATE quantization_kernels core)
endif ()
Copy link

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The changes to this CMakeLists.txt file (wrapping test executable in BUILD_TEST conditional) are unrelated to fixing session closing logic mentioned in the PR title. While the changes themselves are correct and follow the established pattern in the codebase (see src/turbomind/core/CMakeLists.txt:24, src/turbomind/comm/CMakeLists.txt:28), they appear to be a separate refactoring that should ideally be in a different PR or the PR description should mention both types of changes.

Consider updating the PR description to explain that this PR includes both a fix for session closing logic and a refactoring to conditionally build test executables.

Copilot uses AI. Check for mistakes.
Comment on lines +38 to +41
if (BUILD_TEST)
add_executable(test_ipc_comm test_ipc_comm.cc)
target_link_libraries(test_ipc_comm PRIVATE gloo_comm Threads::Threads)
endif ()
Copy link

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The changes to this CMakeLists.txt file (wrapping test executable in BUILD_TEST conditional) are unrelated to fixing session closing logic mentioned in the PR title. While the changes themselves are correct and follow the established pattern in the codebase (see src/turbomind/core/CMakeLists.txt:24, src/turbomind/comm/CMakeLists.txt:28), they appear to be a separate refactoring that should ideally be in a different PR or the PR description should mention both types of changes.

Consider updating the PR description to explain that this PR includes both a fix for session closing logic and a refactoring to conditionally build test executables.

Copilot uses AI. Check for mistakes.
"""End the session."""
logger.info(f'[session] Ending session {self.session_id}')
if self._handle is None and self.step == 0:
logger.info(f'[session] Closing session {self.session_id} before first request')
Copy link

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The early return in async_close() bypasses the reset() call at line 124, which can lead to incomplete session cleanup. When a session is closed before the first request, the session state (prompt, response, history, etc.) will not be reset, and importantly, _session_mgr will not be set to None. This could cause resource leaks if the session object is retained after closing.

Consider calling self.reset() before the early return to ensure consistent cleanup behavior regardless of when the session is closed.

Suggested change
logger.info(f'[session] Closing session {self.session_id} before first request')
logger.info(f'[session] Closing session {self.session_id} before first request')
self.reset()

Copilot uses AI. Check for mistakes.
await handle.async_end(self.session_id)
except (Exception, asyncio.CancelledError, GeneratorExit) as e:
logger.error(f'[async_end] exception caught: {e}')
logger.exception(f'[async_end] exception caught: {type(e).__name__}: {e!r}')
Copy link

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The logger.exception() call automatically includes the exception type, message, and traceback. Adding the exception details manually in the message string creates redundant information in the logs. The pattern used elsewhere in the codebase (e.g., lmdeploy/metrics/metrics_processor.py:81, lmdeploy/pytorch/engine/engine.py:467) is to provide a descriptive message and let logger.exception() handle the exception details.

Consider simplifying to: logger.exception('[async_end] exception caught') to follow the codebase convention and avoid redundancy.

Copilot uses AI. Check for mistakes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants