Big Data and Analytics

Publications in Big Data:

  • Jordà Polo, Claris Castillo, David Carrera, Yolanda Becerra, Ian Whalley, Malgorzata Steinder, Jordi Torres, Eduard Ayguadé. Resource-aware Adaptive Scheduling for MapReduce Clusters. In the ACM/IFIP/USENIX 12th International Middleware Conference (Middleware 2011), Nov. 2011. Lisboa, Portugal. (pdf)
  • Jordà Polo, David Carrera, Yolanda Becerra, Vicenç Beltran, Jordi Torres, Eduard Ayguadé. Performance Management of Accelerated MapReduce Workloads in Heterogeneous Clusters. Accelerators. In the 39th International Conference on Parallel Processing (ICPP2010). San Diego, CA. September 2010 (pdf)
  • Jordà Polo, David Carrera, Yolanda Becerra, Jordi Torres, Eduard Ayguadé, Malgorzata Steinder, Ian Whalley. Performance-Driven Task Co-Scheduling for MapReduce Environments. In the 12th IEEE/IFIP Network Operations and Management Symposium (NOMS2010). April 19-23th, 2010, Osaka, Japan (pdf)
  • Yolanda Becerra, Vicenç Beltran, David Carrera, Marc González, Jordi Torres and Eduard Ayguadé. Speeding Up Distributed MapReduce Applications Using Hardware Accelerators. In the 38th International Conference on Parallel Processing (ICPP). Vienna, Austria. September 2009 (pdf)
  • Jordà Polo, Yolanda Becerra, David Carrera, Vicenç Beltran, Jordi Torres and Eduard Ayguadé. Towards Energy-EfÞcient Management of MapReduce Workloads . In the First international conference on energy-efficient computing and networking. (e-Energy 2010). Germany, April 2010. Poster. (pdf)

Research projects in Big Data:

  • IBM SOW project “Performance Management of Data-Analytic Programming Models in Cloud Environments” (2010-2011). The project will focus on the expansion of large scale data analytics and management challenges posed by the mix of heterogeneous applications in the scope of hybrid data centers.
  • BG/ASF joint research project (BSC – IBM) with the Scalable Data-centric Computing department at IBM TJ Watson research lab (Yorktown, NY).  Active Storage Fabrics (ASF) is a collection of components that surround a parallel in-memory database (PIMD). PIMD is a parallel client, parallel server, key/value object store. This research is part of the MareIncognito research framework between IBM and BSC.

Teaching courses that include Big Data topics:

Dissemination activities in Big Data: