TY - GEN
T1 - Distributed formal concept analysis algorithms based on an iterative MapReduce framework
AU - Xu, Biao
AU - De Fréin, Ruairí
AU - Robson, Eric
AU - Ó Foghlú, Mícheál
PY - 2012
Y1 - 2012
N2 - While many existing formal concept analysis algorithms are efficient, they are typically unsuitable for distributed implementation. Taking the MapReduce (MR) framework as our inspiration we introduce a distributed approach for performing formal concept mining. Our method has its novelty in that we use a light-weight MapReduce runtime called Twister which is better suited to iterative algorithms than recent distributed approaches. First, we describe the theoretical foundations underpinning our distributed formal concept analysis approach. Second, we provide a representative exemplar of how a classic centralized algorithm can be implemented in a distributed fashion using our methodology: we modify Ganter's classic algorithm by introducing a family of algorithms, namely MRGanter and MRGanter+ where the prefix denotes the algorithm's lineage. To evaluate the factors that impact distributed algorithm performance, we compare our algorithms with the state-of-the-art. Experiments conducted on real datasets demonstrate that MRGanter+ is efficient, scalable and an appealing algorithm for distributed problems.
AB - While many existing formal concept analysis algorithms are efficient, they are typically unsuitable for distributed implementation. Taking the MapReduce (MR) framework as our inspiration we introduce a distributed approach for performing formal concept mining. Our method has its novelty in that we use a light-weight MapReduce runtime called Twister which is better suited to iterative algorithms than recent distributed approaches. First, we describe the theoretical foundations underpinning our distributed formal concept analysis approach. Second, we provide a representative exemplar of how a classic centralized algorithm can be implemented in a distributed fashion using our methodology: we modify Ganter's classic algorithm by introducing a family of algorithms, namely MRGanter and MRGanter+ where the prefix denotes the algorithm's lineage. To evaluate the factors that impact distributed algorithm performance, we compare our algorithms with the state-of-the-art. Experiments conducted on real datasets demonstrate that MRGanter+ is efficient, scalable and an appealing algorithm for distributed problems.
KW - Distributed Mining
KW - Formal Concept Analysis
KW - MapReduce
UR - https://www.scopus.com/pages/publications/84864057128
U2 - 10.1007/978-3-642-29892-9_26
DO - 10.1007/978-3-642-29892-9_26
M3 - Conference contribution
AN - SCOPUS:84864057128
SN - 9783642298912
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 292
EP - 308
BT - Formal Concept Analysis - 10th International Conference, ICFCA 2012, Proceedings
T2 - 10th International Conference on Formal Concept Analysis, ICFCA 2012
Y2 - 7 May 2012 through 10 May 2012
ER -