AntiScam Checks #2

zrodevkaan · 2025-12-07T20:14:58Z

this is a low level check.
Please double check on your local machine for testing.

ephemeral doesnt work on regular messages

zerebos

This doesn't have some necessary sanity checking, take a look at detectspam event, should check if in guild, if bot, channel permissions, etc. This is also not toggleable, it feels like maybe it should be collapsed into detect spam.

Also code-wise there are other issues but I can fix after merging

Rework & Merged Into detectspam

Copilot

Pull request overview

This PR refactors the spam detection system by consolidating multiple regex patterns and their validation logic into a unified data-driven approach using a pattern configuration array.

Changes:

Refactored spam detection from separate regex variables to a centralized phishingPatterns array with configurable predicates
Updated embed logging to support multiple reasons instead of a single reason
Removed error handling for message deletion (commented out old code but removed try-catch from new implementation)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/events/detectspam.ts

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 10 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-19T05:51:00Z

src/events/detectspam.ts

+            const hosts = links.map(match => {
+                const url = match[0];
+                const fullUrl = url.startsWith('http') ? url : `https://${url}`;
+                return URL.parse(fullUrl)?.host;
+            }).filter(Boolean);


Missing error handling for URL parsing. If the URL is malformed, URL.parse or new URL() will throw an error. Wrap the URL parsing in a try-catch block to prevent the entire spam detection from failing on malformed URLs.

Copilot · 2026-01-19T05:51:01Z

src/events/detectspam.ts

+    {
+        regex: /(?:http[s]?:\/\/.)?(?:www\.)?[-a-zA-Z0-9@%._+~#=]{2,256}\.[a-z]{2,6}\b[-a-zA-Z0-9@:%_+.~#?&\/=]*/ig,
+        whitelist: [],
+        predicate: (links, self) => links.length == self.maxCount, // this should probably be more than 4 later on.


The equality check should use strict equality. Replace '==' with '===' to avoid type coercion issues and follow JavaScript best practices.

Suggested change

predicate: (links, self) => links.length == self.maxCount, // this should probably be more than 4 later on.

predicate: (links, self) => links.length === self.maxCount, // this should probably be more than 4 later on.

Copilot · 2026-01-19T05:51:01Z

src/events/detectspam.ts

+                const fullUrl = url.startsWith('http') ? url : `https://${url}`;
+                return URL.parse(fullUrl)?.host;
+            }).filter(Boolean);
+            return hosts.some(host => !self.whitelist.includes(host));


The whitelist check should use the full hostname, not just the host. URL.parse().host may include the port number, which could cause legitimate URLs with ports to be flagged. Consider comparing against hostname instead or normalize the comparison.

Copilot · 2026-01-19T05:51:01Z

src/events/detectspam.ts

+const phishingPatterns = [
+    {
+        regex: /([a-zA-Z-\\.]+)?d[il][il]?scorr?(cl|[ldb])([a-zA-Z-\\.]+)?\.(com|net|app|gift|ru|uk)/ig,
+        whitelist: ['discord.com', 'discordapp.com'],


The pattern will incorrectly match 'betterdiscord.app' which was explicitly whitelisted in the old code. Add 'betterdiscord.app' to the whitelist array to maintain the same behavior as the previous implementation.

Suggested change

whitelist: ['discord.com', 'discordapp.com'],

whitelist: ['discord.com', 'discordapp.com', 'betterdiscord.app'],

Copilot · 2026-01-19T05:51:01Z

src/events/detectspam.ts

+        whitelist: [],
+        predicate: (links, self) => links.length == self.maxCount, // this should probably be more than 4 later on.


This generic URL regex pattern is too broad and will match many legitimate URLs. It will trigger on any message containing 4+ URLs regardless of their legitimacy. Consider adding common legitimate domains to the whitelist or refining this pattern to avoid false positives.

Suggested change

whitelist: [],

predicate: (links, self) => links.length == self.maxCount, // this should probably be more than 4 later on.

whitelist: [

'discord.com',

'discordapp.com',

'steamcommunity.com',

'github.com',

'gitlab.com',

'bitbucket.org',

'google.com',

'youtube.com',

'youtu.be',

'twitch.tv',

'twitter.com',

'x.com',

'reddit.com'

],

predicate: (links, self) => {

const suspiciousLinks = links.filter(match => {

const url = match[0].toLowerCase();

return !self.whitelist.some(domain => url.includes(domain));

});

return suspiciousLinks.length === self.maxCount; // this should probably be more than 4 later on.

},

Copilot · 2026-01-19T05:51:02Z

src/events/detectspam.ts

+const phishingPatterns = [
+    {
+        regex: /([a-zA-Z-\\.]+)?d[il][il]?scorr?(cl|[ldb])([a-zA-Z-\\.]+)?\.(com|net|app|gift|ru|uk)/ig,
+        whitelist: ['discord.com', 'discordapp.com'],
+        predicate: (links, self) => {
+            const hosts = links.map(match => {
+                const url = match[0];
+                const fullUrl = url.startsWith('http') ? url : `https://${url}`;
+                return URL.parse(fullUrl)?.host;
+            }).filter(Boolean);
+            return hosts.some(host => !self.whitelist.includes(host));
+        },
+        reason: 'Fake Discord Domain'
+    },
+    {
+        regex: /str?e[ea]?mcomm?m?un[un]?[un]?[tl]?[il][tl]?ty\.(com|net|ru|us)/ig,
+        whitelist: ['steamcommunity.com'],
+        predicate: (links, self) => {
+            const hosts = links.map(match => {
+                const url = match[0];
+                const fullUrl = url.startsWith('http') ? url : `https://${url}`;
+                return URL.parse(fullUrl)?.host;
+            }).filter(Boolean);
+
+            return hosts.some(host => !self.whitelist.includes(host));
+        },


This predicate function is duplicated in the second pattern (lines 30-37). Consider extracting this into a shared helper function to reduce code duplication and improve maintainability.

Suggested change

const phishingPatterns = [

{

regex: /([a-zA-Z-\\.]+)?d[il][il]?scorr?(cl|[ldb])([a-zA-Z-\\.]+)?\.(com|net|app|gift|ru|uk)/ig,

whitelist: ['discord.com', 'discordapp.com'],

predicate: (links, self) => {

const hosts = links.map(match => {

const url = match[0];

const fullUrl = url.startsWith('http') ? url : `https://${url}`;

return URL.parse(fullUrl)?.host;

}).filter(Boolean);

return hosts.some(host => !self.whitelist.includes(host));

},

reason: 'Fake Discord Domain'

},

{

regex: /str?e[ea]?mcomm?m?un[un]?[un]?[tl]?[il][tl]?ty\.(com|net|ru|us)/ig,

whitelist: ['steamcommunity.com'],

predicate: (links, self) => {

const hosts = links.map(match => {

const url = match[0];

const fullUrl = url.startsWith('http') ? url : `https://${url}`;

return URL.parse(fullUrl)?.host;

}).filter(Boolean);

return hosts.some(host => !self.whitelist.includes(host));

},

function hasNonWhitelistedHost(links: RegExpMatchArray[], whitelist: string[]): boolean {

const hosts = links.map(match => {

const url = match[0];

const fullUrl = url.startsWith('http') ? url : `https://${url}`;

return URL.parse(fullUrl)?.host;

}).filter(Boolean);

return hosts.some(host => !whitelist.includes(host as string));

}

const phishingPatterns = [

{

regex: /([a-zA-Z-\\.]+)?d[il][il]?scorr?(cl|[ldb])([a-zA-Z-\\.]+)?\.(com|net|app|gift|ru|uk)/ig,

whitelist: ['discord.com', 'discordapp.com'],

predicate: (links, self) => hasNonWhitelistedHost(links, self.whitelist),

reason: 'Fake Discord Domain'

},

{

regex: /str?e[ea]?mcomm?m?un[un]?[un]?[tl]?[il][tl]?ty\.(com|net|ru|us)/ig,

whitelist: ['steamcommunity.com'],

predicate: (links, self) => hasNonWhitelistedHost(links, self.whitelist),

Copilot · 2026-01-19T05:51:02Z

src/events/detectspam.ts

+        whitelist: [],
+        predicate: (links, self) => links.length == self.maxCount, // this should probably be more than 4 later on.
+        reason: 'Potential Scam Message',
+        maxCount: 4


The magic number 4 is used for maxCount without explanation. Consider making this a named constant at the top of the file with a descriptive name like MAX_URLS_BEFORE_SPAM_CHECK or adding a comment explaining why 4 URLs is the threshold.

Copilot · 2026-01-19T05:51:02Z

src/events/detectspam.ts

+            const hosts = links.map(match => {
+                const url = match[0];
+                const fullUrl = url.startsWith('http') ? url : `https://${url}`;
+                return URL.parse(fullUrl)?.host;


URL.parse is deprecated in Node.js. Use the new URL() constructor instead. The URL.parse method was deprecated in favor of the WHATWG URL API. Replace URL.parse(fullUrl) with new URL(fullUrl).hostname to get the host.

Copilot · 2026-01-19T05:51:02Z

src/events/detectspam.ts

+            const hosts = links.map(match => {
+                const url = match[0];
+                const fullUrl = url.startsWith('http') ? url : `https://${url}`;
+                return URL.parse(fullUrl)?.host;


URL.parse is deprecated in Node.js. Use the new URL() constructor instead. The URL.parse method was deprecated in favor of the WHATWG URL API. Replace URL.parse(fullUrl) with new URL(fullUrl).hostname to get the host.

Copilot · 2026-01-19T05:51:02Z

src/events/detectspam.ts

+            const hosts = links.map(match => {
+                const url = match[0];
+                const fullUrl = url.startsWith('http') ? url : `https://${url}`;
+                return URL.parse(fullUrl)?.host;
+            }).filter(Boolean);


Missing error handling for URL parsing. If the URL is malformed, URL.parse or new URL() will throw an error. Wrap the URL parsing in a try-catch block to prevent the entire spam detection from failing on malformed URLs.

zrodevkaan added 5 commits December 7, 2025 14:13

Antiscam Checks

62825bd

Error handling

541bdb0

Guh

62b594a

ephemeral doesnt work on regular messages

Remove log

75201e6

Fix lint

0864ae0

zerebos requested changes Dec 10, 2025

View reviewed changes

Revamp

2c69b94

Rework & Merged Into detectspam

zerebos requested a review from Copilot January 19, 2026 03:59

Copilot started reviewing on behalf of zerebos January 19, 2026 04:00 View session

Copilot AI reviewed Jan 19, 2026

View reviewed changes

Fixed

2a17bcb

zrodevkaan requested a review from zerebos January 19, 2026 04:28

Whoops

dfad828

zerebos requested a review from Copilot January 19, 2026 05:48

Copilot started reviewing on behalf of zerebos January 19, 2026 05:48 View session

Copilot AI reviewed Jan 19, 2026

View reviewed changes

	predicate: (links, self) => links.length == self.maxCount, // this should probably be more than 4 later on.
	predicate: (links, self) => links.length === self.maxCount, // this should probably be more than 4 later on.

	whitelist: ['discord.com', 'discordapp.com'],
	whitelist: ['discord.com', 'discordapp.com', 'betterdiscord.app'],

		whitelist: [],
		predicate: (links, self) => links.length == self.maxCount, // this should probably be more than 4 later on.

-        whitelist: [],
-        predicate: (links, self) => links.length == self.maxCount, // this should probably be more than 4 later on.
+        whitelist: [
+            'discord.com',
+            'discordapp.com',
+            'steamcommunity.com',
+            'github.com',
+            'gitlab.com',
+            'bitbucket.org',
+            'google.com',
+            'youtube.com',
+            'youtu.be',
+            'twitch.tv',
+            'twitter.com',
+            'x.com',
+            'reddit.com'
+        ],
+        predicate: (links, self) => {
+            const suspiciousLinks = links.filter(match => {
+                const url = match[0].toLowerCase();
+                return !self.whitelist.some(domain => url.includes(domain));
+            });
+            return suspiciousLinks.length === self.maxCount; // this should probably be more than 4 later on.
+        },

AntiScam Checks #2

Are you sure you want to change the base?

AntiScam Checks #2

Uh oh!

Conversation

zrodevkaan commented Dec 7, 2025

Uh oh!

zerebos left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants