Thread: Re: revamp row-security tracking
On Thu, 21 Nov 2024 at 18:00, Nathan Bossart <nathandbossart@gmail.com> wrote: > > In light of CVE-2024-10976, which was fixed by commit cd7ab57, I'd like to > propose a bigger change to this area of the code that aims to future-proof > it a bit. Instead of requiring hackers to carefully cart around whether a > query references a table with RLS enabled, I think we should instead > accumulate such information globally and require higher-level routines like > fireRIRrules() and inline_set_returning_function() to inspect it as needed. > > The attached patch accomplishes this by establishing a global queue of > row-security "nest levels" that the aforementioned higher-level callers can > use. I'm not convinced that this is an improvement. The code in check_sql_fn_retval() is building a Query struct from scratch, so it seems perfectly natural for it to be responsible for setting all the required fields, based on the information it has available. With this patch, check_sql_fn_retval() is returning a potentially incorrectly marked Query at the end of the querytree list, which the caller is responsible for fixing up, which doesn't seem ideal. I'm also not a fan of using global variables in this way, or the resulting need to hook into the transaction management system to tidy up. The end result is that the places where the flag is set are moved further away from where RLS policies are applied, which IMO makes the code much harder to follow. There is exactly one place where RLS policies are applied, and it seems much more natural for it to have responsibility for setting this flag. I think that a slightly neater way for it to handle that would be to modify fireRIRrules(), adding an extra parameter "bool *hasRowSecurity" that it would set to true if RLS is enabled for the query it is rewriting. Doing that forces all callers to think about whether or not that affects some outer query. For example, ApplyRetrieveRule() would then do: rule_action = fireRIRrules(rule_action, activeRIRs, &parsetree->hasRowSecurity); rather than having a separate second step to update the flag on "parsetree", and similarly for fireRIRrules()'s recursive calls to itself. If, in the future, it becomes necessary to invoke fireRIRrules() on more parts of a Query, it's then much more likely that the new code won't forget to update the parent query's flag. Regards, Dean
Given there doesn't seem to be a huge amount of interest in this, I plan to mark it as Withdrawn soon. -- nathan
Nathan Bossart <nathandbossart@gmail.com> writes: > Given there doesn't seem to be a huge amount of interest in this, I plan to > mark it as Withdrawn soon. I think you're being too impatient. It's still an interesting topic, it just needs more thought to get to something committable. I find this has-row-security marking problem to be comparable to the has-sublinks marking problem. We've had tons of bugs-of-omission with that too, and the present code feels ugly and not any less prone to omissions than it ever was. I wonder whether considering both problems together would yield any insights, following Polya's dictum that "the more general problem may be easier to solve". One straightforward idea is to just not do the marking at all, but rather require places that want to know these properties to do a fresh search of the query tree when they want to know it. That obviously has performance questions to answer, but it's easier to give answers to performance questions than "is this correct" questions. regards, tom lane
On Mon, Feb 17, 2025 at 01:08:29PM -0500, Tom Lane wrote: > I think you're being too impatient. It's still an interesting > topic, it just needs more thought to get to something committable. Maybe I am. Thanks for chiming in. > I find this has-row-security marking problem to be comparable > to the has-sublinks marking problem. We've had tons of > bugs-of-omission with that too, and the present code feels > ugly and not any less prone to omissions than it ever was. > > I wonder whether considering both problems together would yield any > insights, following Polya's dictum that "the more general problem may > be easier to solve". > > One straightforward idea is to just not do the marking at all, > but rather require places that want to know these properties > to do a fresh search of the query tree when they want to know > it. That obviously has performance questions to answer, but > it's easier to give answers to performance questions than > "is this correct" questions. That could be worth a try. The reason I started with the global queue idea was that we seem to reliably discover relations with RLS enabled, we just tend to miss propagating that information to the top-level query. We could invent a separate query tree walker for discovering RLS, etc. to keep it centralized. However, besides the performance questions, it would be another separate piece of code to keep updated. Perhaps another variation on this idea is to create a query walker that just looks for hasRowSecurity flags throughout the tree (and to use that to mark the plan cache entry appropriately). -- nathan
Nathan Bossart <nathandbossart@gmail.com> writes: > Perhaps another variation > on this idea is to create a query walker that just looks for hasRowSecurity > flags throughout the tree (and to use that to mark the plan cache entry > appropriately). That seems like a pretty plausible compromise position. So we'd redefine Query.hasRowSecurity as summarizing the situation for only the Query's own rtable entries, not recursively for sub-Queries. regards, tom lane