Skip to content

Possible provider cleanup bug on Provider 0.12.0. #627

@Christoph203

Description

@Christoph203

Dashboard showed 0 active leases, but Kubernetes still had 3 tenant namespaces running GPU miner workloads. Provider inventory showed 3 of 4 GPUs allocated.

Provider logs repeatedly showed:

lease is out of funds
sending withdraw
payment closed
failed to do lease withdrawal

for lease IDs 27025137, 27025330 and 27025339.

The miner deployments continued running until I manually deleted them. They were NOT recreated by the provider, suggesting the leases were no longer actively managed.

After deleting the namespaces, all GPUs were immediately released.

Provider info:
chris@akash:~$ sudo kubectl describe pod -n akash-services akash-provider-0 | grep Image:
Image: ghcr.io/akash-network/provider:0.12.0 Image: ghcr.io/akash-network/provider:0.12.0

Provider-Wallet: akash1pvskd94jtkph9vgqpf6ecg2as8pvck9w5a7wga
domain: provider.cmolls.de

deployments before cleanup.log
ProviderLog before cleanup.log
pods before cleanup.log
Image

pods after cleanup.log
moreProviderLogs.log
HelmInfo.log

OutOfBalanceSearch.log

Search_JobID_27025137.log
Search_JobID_27025339.log
Search_JobID_27025330.log

Please note the Power-drop on June 5th which indicates one job was deleted. The power difference is exactly 1 RTX3090 needs.
Image

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions