Social Choice Methods for Database Aggregation
Pith reviewed 2026-05-24 18:12 UTC · model grok-4.3
The pith
Social choice aggregators can merge first-order databases while preserving integrity constraints and commuting with certain queries.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We identify classes of aggregators that respect integrity constraints in the aggregated database, provided these are satisfied in all individual databases. We also characterise languages for first-order queries on which the answer to a query on the aggregated database coincides with the aggregation of the answers to the query obtained on each individual database. This is a first step on the application of techniques from social choice theory to knowledge representation in databases.
What carries the argument
Aggregators from social choice theory applied directly to first-order relational database structures.
If this is right
- If the aggregator respects constraints in inputs, the output database satisfies them too.
- For characterized query languages, aggregation and querying commute.
- Multiple sources can contribute data without violating shared rules.
- This allows consistent integration of distributed or conflicting knowledge bases modeled as databases.
Where Pith is reading between the lines
- Such methods might handle cases where not all sources satisfy constraints by using different aggregator properties.
- Applications could include merging ontologies or RDF data from web sources.
- Implementation in query engines could test the commutation property on real datasets.
Load-bearing premise
That databases are faithfully modeled as first-order relational structures and that the chosen aggregators interact with integrity constraints and queries through required closure properties.
What would settle it
Finding an integrity constraint and aggregator in the identified class where all input databases satisfy the constraint but the output does not, or a query in the characterized language where the two ways of computing the answer differ.
read the original abstract
Knowledge can be represented compactly in multiple ways, from a set of propositional formulas, to a Kripke model, to a database. In this paper we study the aggregation of information coming from multiple sources, each source submitting a database modelled as a first-order relational structure. In the presence of integrity constraints, we identify classes of aggregators that respect them in the aggregated database, provided these are satisfied in all individual databases. We also characterise languages for first-order queries on which the answer to a query on the aggregated database coincides with the aggregation of the answers to the query obtained on each individual database. This contribution is meant to be a first step on the application of techniques from social choice theory to knowledge representation in databases.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper applies social choice theory to aggregate databases modeled as first-order relational structures. It identifies classes of aggregators that respect integrity constraints in the aggregated database whenever all input databases satisfy them, and characterizes first-order query languages such that the answer to a query on the aggregated database equals the aggregation of the answers to that query on each individual database.
Significance. If the characterizations hold, the work establishes a clean theoretical bridge between social choice and database theory for multi-source aggregation under constraints and for commuting aggregation with querying. It builds directly on standard relational-structure modeling and first-order logic without introducing free parameters, ad-hoc axioms, or domain-size restrictions, and the results are framed as a first step toward practical knowledge-representation applications.
minor comments (2)
- The abstract and introduction would benefit from a brief concrete example (e.g., a small relational schema with an integrity constraint and two source databases) to illustrate the aggregator classes and the query-commutation property before the general statements.
- Notation for the aggregator operations and the relational structures could be introduced with an explicit table or diagram in the preliminaries section to improve readability for readers coming from only one of the two fields.
Simulated Author's Rebuttal
We thank the referee for the positive summary of the paper, the assessment of its significance, and the recommendation of minor revision. No specific major comments appear in the report.
Circularity Check
No significant circularity; characterizations are self-contained
full rationale
The paper's central results are characterizations of aggregator classes preserving integrity constraints (when all inputs satisfy them) and FO query languages where query(aggregate(DBs)) equals aggregate(query(DBs)). These are proved from standard definitions of relational structures, social choice aggregators, and first-order logic closure properties. No equations reduce results to fitted inputs or self-definitions; no load-bearing self-citations or imported uniqueness theorems appear; the modeling choice is external to the field and the theorems are the target rather than premises. The derivation chain is therefore independent of its own outputs.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Databases are modeled as first-order relational structures
- domain assumption Integrity constraints hold in every individual database
Reference graph
Works this paper leans on
-
[1]
S. Abiteboul, R. Hull & V . Vianu (1995): Foundations of Databases. Addison-Wesley
work page 1995
-
[2]
O. Arieli, M. Denecker & M. Bruynooghe (2007): Distance semantics for database repair. Annals of Math- ematics and Artificial Intelligence 50(3), pp. 389–415, doi:10.1007/s10472-007-9074-1
-
[3]
K. J. Arrow (1963): Social Choice and Individual Values, 2nd edition. John Wiley and Sons
work page 1963
-
[4]
C. Baral, S. Kraus & J. Minker (1991): Combining Multiple Knowledge Bases. IEEE Transactions on Knowl- edge and Data Engineering 3(2), pp. 208–220, doi:10.1109/69.88001
-
[5]
C. Baral, S. Kraus, J. Minker & V . S. Subrahmanian (1992): Combining knowledge bases consisting of first-order theories. Computational Intelligence 8(1), doi:10.1016/0004-3702(80)90014-4. 66 Social Choice Methods for Database Aggregation
-
[6]
F. Belardinelli & U. Grandi (2019): A Social Choice Theoretic Perspective on Database Aggregation (Ex- tended Abstract). In: Proceedings of the 18th International Conference on Autonomous Agents and Multia- gent Systems (AAMAS)
work page 2019
-
[7]
H. A. Blair & V . S. Subrahmanian (1989): Paraconsistent Logic Programming. Theoretical Computer Sci- ence 68(2), pp. 135–154, doi:10.1016/0304-3975(89)90126-6
- [8]
-
[9]
M. Caminada & G. Pigozzi (2011): On judgment aggregation in abstract argumentation . Autonomous Agents and Multiagent Systems 22(1), pp. 64–102, doi:10.1007/s10458-009-9116-7
-
[10]
Ashok K. Chandra & Philip M. Merlin (1977): Optimal Implementation of Conjunctive Queries in Relational Data Bases. In: Proceedings of the Ninth Annual ACM Symposium on Theory of Computing , STOC ’77, ACM, New York, NY , USA, pp. 77–90, doi:10.1145/800105.803397
-
[11]
W. Chen & U. Endriss (2017): Preservation of Semantic Properties during the Aggregation of Abstract Argumentation Frameworks. In: Proceedings of the 16th Conference on Theoretical Aspects of Rationality and Knowledge (TARK), doi:10.1007/978-3-540-77684-0 4
-
[12]
F. Dietrich & C. List (2007): Arrow’s Theorem in Judgment Aggregation. Social Choice and Welfare 29(1), pp. 19–33, doi:10.1007/s00355-006-0196-x
-
[13]
F. Dietrich & C. List (2007): Judgment Aggregation by Quota Rules: Majority Voting Generalized. Journal of Theoretical Politics 19(4), pp. 391–424, doi:10.1016/0022-0531(75)90062-9
-
[14]
E. Dokow & R. Holzman (2010): Aggregation of binary evaluations. Journal of Economic Theory 145(2), pp. 495–511, doi:10.1016/j.jet.2007.10.004
-
[15]
J. Doyle & M. P. Wellman (1991): Impediments to Universal Preference-based Default Theories. Artificial Intelligence 49(1), pp. 97–128, doi:10.1016/0004-3702(91)90007-7
-
[16]
Endriss (2016): Judgment Aggregation
U. Endriss (2016): Judgment Aggregation . In F. Brandt, V . Conitzer, U. Endriss, J. Lang & A. D. Procaccia, editors: Handbook of Computational Social Choice , Cambridge University Press, doi:10.1017/CBO9781107446984.018
-
[17]
U. Endriss & U. Grandi (2014): Binary Aggregation by Selection of the Most Representative Voter . In: Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI-2014)
work page 2014
-
[18]
U. Endriss & U. Grandi (2017): Graph Aggregation . Artificial Intelligence 245, pp. 86–114, doi:10.1016/j.artint.2017.01.001
-
[19]
P. Everaere, S. Konieczny & P. Marquis (2015): Belief Merging versus Judgment Aggregation. In: Proceed- ings of the 14th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
work page 2015
-
[20]
A. Gionis, H. Mannila & P. Tsaparas (2007): Clustering aggregation. ACM Transactions on Knowledge Discovery from Data 1(1), p. 4, doi:10.1145/1217299.1217303
-
[21]
U. Grandi & U. Endriss (2011): Binary Aggregation with Integrity Constraints. In: Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI)
work page 2011
-
[22]
U. Grandi & U. Endriss (2013): Lifting Integrity Constraints in Binary Aggregation. Artificial Intelligence 199–200, pp. 45–66, doi:10.1016/j.artint.2013.05.001
-
[23]
E. Gregoire & S. Konieczny (2006): Logic-based approaches to information fusion. Information Fusion 7(1), pp. 4 – 18, doi:10.1016/j.inffus.2005.08.001. Logic-based Approaches to Information Fusion
-
[24]
F. van Harmelen, V . Lifschitz & B. Porter (2007):Handbook of Knowledge Representation. Elsevier Science, San Diego, USA
work page 2007
-
[25]
Hintikka (1962): Knowledge and Belief: An Introduction to the Logic of the Two Notions
J. Hintikka (1962): Knowledge and Belief: An Introduction to the Logic of the Two Notions. Cornell Univer- sity Press. F. Belardinelli & U. Grandi 67
work page 1962
-
[26]
B. Kimelfeld, P. Kolaitis & J. Stoyanovich (2018): Computational Social Choice Meets Databases . In: Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI) , doi:10.24963/ijcai.2018/44
-
[27]
Konieczny (2000): On the Difference between Merging Knowledge Bases and Combining them
S. Konieczny (2000): On the Difference between Merging Knowledge Bases and Combining them . In: Pro- ceedings of the Seventh International Conference on Principles of Knowledge Representation and Reasoning (KR)
work page 2000
-
[28]
Artificial Intelligence 157(1), pp
S Konieczny, J Lang & P Marquis (2004): DA2 merging operators. Artificial Intelligence 157(1), pp. 49 – 79
work page 2004
-
[29]
S. Konieczny & R. Pino P ´erez (2002): Merging Information Under Constraints: A Logical Framework . Journal of Logic and Computation 12(5), pp. 773–808, doi:10.1093/logcom/12.5.773
-
[30]
S ´ebastien Konieczny, J´erˆome Lang & Pierre Marquis (2002): Distance Based Merging: A General Frame- work and some Complexity Results. In: Proceedings of the Eights International Conference on Principles and Knowledge Representation and Reasoning (KR)
work page 2002
-
[31]
P. Liberatore & M. Schaerf (1998): Arbitration (or How to Merge Knowledge Bases). IEEE Transactions on Knowledge and Data Engineering 10(1), pp. 76–90, doi:10.1109/69.667090
-
[32]
Libkin (2015): How to Define Certain Answers
L. Libkin (2015): How to Define Certain Answers. In: Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI)
work page 2015
-
[33]
Mendelzon (1996): Merging Databases under Constraints
Jinxin Lin & Alberto O. Mendelzon (1996): Merging Databases under Constraints. International Journal of Cooperative Information Systems 7, pp. 55–76, doi:10.1142/S021821579200009X
-
[34]
D. Maier, J. Ullman & M. Vardi (1984): On the Foundations of the Universal Relation Model. ACM Trans- actions on Database Systems 9(2), pp. 283–308, doi:10.1145/329.318580
-
[35]
P. Maynard-Zhang & D. J. Lehmann (2003): Representing and Aggregating Conflicting Beliefs. Journal of Artificial Intelligence Research 19, pp. 155–203, doi:10.1613/jair.1206
-
[36]
M. Miller & D. Osherson (2009): Methods for distance-based judgment aggregation . Social Choice and Welfare 32(4), pp. 575–601, doi:10.1007/s00355-008-0340-x
-
[37]
D. Porello & U. Endriss (2012): Ontology merging as social choice: Judgment aggregation under the open world assumption. Journal of Logic and Computation 24, pp. 1229–1249, doi:10.1093/logcom/exs056
-
[38]
V . S. Subrahmanian (1992):Paraconsistent Disjunctive Deductive Databases. Theoretical Computer Science 93(1), pp. 115–141, doi:10.1016/0304-3975(92)90214-Z
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.