Biclustering is a powerful data analysis technique that allows simultaneous clustering of both rows and columns of a data matrix. Since her PhD, Dr. Rosana Veroneze works on the development of enumerative biclustering algorithms with unique properties and functionalities that can be exploited in areas of high potential of contribution. To continue the advances in the biclustering area and produce high impact research at the forefront of science, the establishment of national and international partnerships is of great importance. For this reason, Dr. Veroneze spent three months in a technical visit to the AIA research group at UCLouvain in 2019. They are pioneers in showing that it is possible to employ constraint programming (CP) techniques for modeling and solving a wide variety of data mining tasks, including constraint-based pattern mining tasks. Remarkably, they developed a basic CP model for specifying frequent itemsets and showed that this model can easily be extended to realize the other settings of this problem. This contrasts with typical procedural data mining systems where the underlying procedures need to be modified in order to accommodate new types of constraint, or novel combinations thereof, which is usually non-trivial to be done. Moreover, the need to allow user-specified combinations of constraints is recognized in the data mining community. In its turn, biclustering problems can also be seen as constraint optimization problems. Thus, the intention of this collaboration is (1) to study the deeper integration of search and modeling strategies used in CP and biclustering; and (2) to scrutinize in detail which constraints or optimization criteria are interesting for biclustering tasks, in the context of new and existing applications. Besides to explore constraints to select interesting patterns in supervised, semi-supervised and unsupervised scenarios, we will also explore constraints to perform an automatic summarization of the results in the form of a compact description that is relevant, non-redundant and easily interpretable. We highlight that the results obtained during the technical visit are very promising and provide concrete starting points for this project.
News published in Agência FAPESP Newsletter about the scholarship: