Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: Normalize User.tms #10992

Draft
wants to merge 19 commits into
base: master
Choose a base branch
from
Draft

Conversation

Dschoordsch
Copy link
Contributor

@Dschoordsch Dschoordsch commented Mar 12, 2025

Description

Partially Fixes #9932
The tms array was removed from User in postgres as it is redundant to the information we can obtain by querying TeamMember. Because we often need this array to check permissions, I wanted to avoid the additional roundtrip to the db necessary to construct the array from TeamMember. Instead the tms array is queried already in the dataloader.

Demo

Nothing should change for the user.

Testing scenarios

Joining, leaving and archiving teams

Final checklist

  • I checked the code review guidelines
  • I have added Metrics Representative as reviewer(s) if my PR invovles metrics/data/analytics related changes
  • I have performed a self-review of my code, the same way I'd do it for any other team member
  • I have tested all cases I listed in the testing scenarios and I haven't found any issues or regressions
  • Whenever I took a non-obvious choice I added a comment explaining why I did it this way
  • I added the label Skip Maintainer Review Indicating the PR only requires reviewer review and can be merged right after it's approved if the PR introduces only minor changes, does not contain any architectural changes or does not introduce any new patterns and I think one review is sufficient'
  • PR title is human readable and could be used in changelog

Copy link
Contributor Author

@Dschoordsch Dschoordsch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure solving this in code is the right call for the tms field as it's used in the connection logic which should be kept lean.
I'm thinking about making a UserWithTeams view for this use case.

@Dschoordsch
Copy link
Contributor Author

Views are clunky when altering the underlying table. We can achieve the same with just a custom loader.

@Dschoordsch Dschoordsch force-pushed the chore/9932/normalizeUserTms branch from 9035ff1 to c44c37b Compare March 13, 2025 12:38
@Dschoordsch Dschoordsch force-pushed the chore/9932/normalizeUserTms branch from b1f681c to 3099ba3 Compare March 18, 2025 10:07
@Dschoordsch Dschoordsch marked this pull request as ready for review March 18, 2025 15:41
@github-actions github-actions bot requested a review from tianrunhe March 18, 2025 15:42
export async function down(db: Kysely<any>): Promise<void> {
await db.schema
.alterTable('User')
.addColumn('tms', sql`character varying(100)[]`, (col) => col.defaultTo('{}').notNull())
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

varchar(100)[] means that the textual representation of the ENTIRE array must be < 100 chars, i.e. this will break if you join 10 or 12 teams.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isn't that the same what we currently have or am I missing something?

tms character varying(100)[] DEFAULT '{}'::character varying[] NOT NULL,

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤔 THAT IS INTERESTING!
Either my understanding is wrong or we've got a problem! will investigate...

eb
.selectFrom('TeamMember')
.select((se) => se.fn('array_agg', ['teamId']).as('tms'))
.where('TeamMember.userId', '=', eb.ref('User.id')),
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should ignore inactive teams and teamMember objects that have been removed from the team

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Makes sense, will fix.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That archived team really got me. It's not enough to check the TeamMember but we also need to check the Team the team member points to :(

await db
.updateTable('User')
.set((eb) => ({
tms: eb.fn.coalesce(
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

question-- i don't understand how this PR works 😅 we're setting the tms in a migration, but have removed all the places where we update the tms column?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You're looking at the down migration. We're dropping tms in up, that's why the update code gets deleted.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ah! dumb me. ok

@@ -57,6 +59,7 @@ const removeFromOrg = async (
dataLoader.get('users').loadNonNull(userId)
])
dataLoader.clearAll('organizationUsers')
const {tms} = user
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

running yarn pg:generate should remove tms from the schema & will make finding things like this easier, at least i hope!

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've updated the description. Although I'm removing tms from the User table, I'm still querying it in the dataloader. I guess a different solution could also have been to add a trigger in PG to just get rid of the update logic in code. For performance reasons, especially for socket connects/disconnects I did not want to double the roundtrips to the db.

@Dschoordsch Dschoordsch marked this pull request as draft March 19, 2025 10:30
@tianrunhe tianrunhe removed their request for review March 19, 2025 17:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

chore: normalize data in PG across tables
2 participants