This project provides participants with 5000 gene sets that do not have any annotations. The goal of the project is to identify the global structure of this dataset based on gene set similarity and content. This project is open ended without specific solution or objective metric for evaluation. A two page report that describes the analysis is required for evaluation.