-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy path3793-FindUsersWithHighTokenUsage.sql
More file actions
72 lines (69 loc) · 3.05 KB
/
3793-FindUsersWithHighTokenUsage.sql
File metadata and controls
72 lines (69 loc) · 3.05 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
-- 3793. Find Users with High Token Usage
-- Table: prompts
-- +-------------+---------+
-- | Column Name | Type |
-- +-------------+---------+
-- | user_id | int |
-- | prompt | varchar |
-- | tokens | int |
-- +-------------+---------+
-- (user_id, prompt) is the primary key (unique value) for this table.
-- Each row represents a prompt submitted by a user to an AI system along with the number of tokens consumed.
-- Write a solution to analyze AI prompt usage patterns based on the following requirements:
-- For each user, calculate the total number of prompts they have submitted.
-- For each user, calculate the average tokens used per prompt (Rounded to 2 decimal places).
-- Only include users who have submitted at least 3 prompts.
-- Only include users who have submitted at least one prompt with tokens greater than their own average token usage.
-- Return the result table ordered by average tokens in descending order, and then by user_id in ascending order.
-- The result format is in the following example.
-- Example:
-- Input:
-- prompts table:
-- +---------+--------------------------+--------+
-- | user_id | prompt | tokens |
-- +---------+--------------------------+--------+
-- | 1 | Write a blog outline | 120 |
-- | 1 | Generate SQL query | 80 |
-- | 1 | Summarize an article | 200 |
-- | 2 | Create resume bullet | 60 |
-- | 2 | Improve LinkedIn bio | 70 |
-- | 3 | Explain neural networks | 300 |
-- | 3 | Generate interview Q&A | 250 |
-- | 3 | Write cover letter | 180 |
-- | 3 | Optimize Python code | 220 |
-- +---------+--------------------------+--------+
-- Output:
-- +---------+---------------+------------+
-- | user_id | prompt_count | avg_tokens |
-- +---------+---------------+------------+
-- | 3 | 4 | 237.5 |
-- | 1 | 3 | 133.33 |
-- +---------+---------------+------------+
-- Explanation:
-- User 1:
-- Total prompts = 3
-- Average tokens = (120 + 80 + 200) / 3 = 133.33
-- Has a prompt with 200 tokens, which is greater than the average
-- Included in the result
-- User 2:
-- Total prompts = 2 (less than the required minimum)
-- Excluded from the result
-- User 3:
-- Total prompts = 4
-- Average tokens = (300 + 250 + 180 + 220) / 4 = 237.5
-- Has prompts with 300 and 250 tokens, both greater than the average
-- Included in the result
-- The Results table is ordered by avg_tokens in descending order, then by user_id in ascending order
SELECT
user_id,
COUNT(*) AS prompt_count, --
ROUND(AVG(tokens), 2) AS avg_tokens
FROM
prompts
GROUP BY
user_id
HAVING
prompt_count >= 3 AND -- Only include users who have submitted at least 3 prompts.
MAX(tokens) > avg_tokens -- Only include users who have submitted at least one prompt with tokens greater than their own average token usage.
ORDER BY
avg_tokens DESC, user_id ASC -- The Results table is ordered by avg_tokens in descending order, then by user_id in ascending order