algorithms/leetcode/3793-FindUsersWithHighTokenUsage.sql at master · freewu/algorithms · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
-- 3793. Find Users with High Token Usage
-- Table: prompts
-- +-------------+---------+
-- | Column Name | Type    |
-- +-------------+---------+
-- | user_id     | int     |
-- | prompt      | varchar |
-- | tokens      | int     |
-- +-------------+---------+
-- (user_id, prompt) is the primary key (unique value) for this table.
-- Each row represents a prompt submitted by a user to an AI system along with the number of tokens consumed.
-- Write a solution to analyze AI prompt usage patterns based on the following requirements:

-- For each user, calculate the total number of prompts they have submitted.
-- For each user, calculate the average tokens used per prompt (Rounded to 2 decimal places).
-- Only include users who have submitted at least 3 prompts.
-- Only include users who have submitted at least one prompt with tokens greater than their own average token usage.
-- Return the result table ordered by average tokens in descending order, and then by user_id in ascending order.

-- The result format is in the following example.

-- Example:
-- Input:
-- prompts table:
-- +---------+--------------------------+--------+
-- | user_id | prompt                   | tokens |
-- +---------+--------------------------+--------+
-- | 1       | Write a blog outline     | 120    |
-- | 1       | Generate SQL query       | 80     |
-- | 1       | Summarize an article     | 200    |
-- | 2       | Create resume bullet     | 60     |
-- | 2       | Improve LinkedIn bio     | 70     |
-- | 3       | Explain neural networks  | 300    |
-- | 3       | Generate interview Q&A   | 250    |
-- | 3       | Write cover letter       | 180    |
-- | 3       | Optimize Python code     | 220    |
-- +---------+--------------------------+--------+
-- Output:
-- +---------+---------------+------------+
-- | user_id | prompt_count  | avg_tokens |
-- +---------+---------------+------------+
-- | 3       | 4             | 237.5      |
-- | 1       | 3             | 133.33     |
-- +---------+---------------+------------+
-- Explanation:
-- User 1:
-- Total prompts = 3
-- Average tokens = (120 + 80 + 200) / 3 = 133.33
-- Has a prompt with 200 tokens, which is greater than the average
-- Included in the result
-- User 2:
-- Total prompts = 2 (less than the required minimum)
-- Excluded from the result
-- User 3:
-- Total prompts = 4
-- Average tokens = (300 + 250 + 180 + 220) / 4 = 237.5
-- Has prompts with 300 and 250 tokens, both greater than the average
-- Included in the result
-- The Results table is ordered by avg_tokens in descending order, then by user_id in ascending order
SELECT
    user_id,
    COUNT(*) AS prompt_count, --
    ROUND(AVG(tokens), 2) AS avg_tokens
FROM
    prompts
GROUP BY
    user_id
HAVING
    prompt_count >= 3 AND -- Only include users who have submitted at least 3 prompts.
    MAX(tokens) > avg_tokens -- Only include users who have submitted at least one prompt with tokens greater than their own average token usage.
ORDER BY
    avg_tokens DESC, user_id ASC -- The Results table is ordered by avg_tokens in descending order, then by user_id in ascending order