Apache Spark is a full-fledged, data engineering toolkit that enables you to operate on large data sets without worrying about the underlying infrastructure. Spark is known for its speed, which is a result of improved implementation of MapReduce that focuses on keeping data in memory instead of persisting data on disk. However, in addition to its great benefits, Spark has its issues including complex deployment and scaling. How best to deal with these and other challenges and maximize the value you are getting from Spark? Drawing on experiences across dozens of production deployments, Pepperdata Field Engineer Alexander Pierce explores issues observed in a cluster environment with Apache Spark and offers guidelines on how to overcome the most common Spark problems you are likely to encounter. Alex will also accompany his presentation with demonstrations and examples. Attendees can use this information to improve the usability and supportability of Spark in their projects and successfully overcome common challenges. During this webinar, attendees will learn about: – Serialization and its role in Spark performance – Partition recommendations and sizing – Executor resource sizing and heap utilization – Driver-side vs. executor-side processing: reducing idle executor time – Using shading to manage library conflicts

Hora

18:30 - 19:00 hs GMT+1

Organizador

Pepperdata
Compartir
Enviar a un amigo
Mi email *
Email destinatario *
Comentario *
Repite estos números *
Control de seguridad
Julio / 2020 131 webinars
Lunes
Martes
Miércoles
Jueves
Viernes
Sábado
Domingo
Lun 29 de Julio de 2020
Mar 30 de Julio de 2020
Mié 01 de Julio de 2020
Jue 02 de Julio de 2020
Vie 03 de Julio de 2020
Sáb 04 de Julio de 2020
Dom 05 de Julio de 2020
Lun 06 de Julio de 2020
Mar 07 de Julio de 2020
Mié 08 de Julio de 2020
Jue 09 de Julio de 2020
Vie 10 de Julio de 2020
Sáb 11 de Julio de 2020
Dom 12 de Julio de 2020
Lun 13 de Julio de 2020
Mar 14 de Julio de 2020
Mié 15 de Julio de 2020
Jue 16 de Julio de 2020
Vie 17 de Julio de 2020
Sáb 18 de Julio de 2020
Dom 19 de Julio de 2020
Lun 20 de Julio de 2020
Mar 21 de Julio de 2020
Mié 22 de Julio de 2020
Jue 23 de Julio de 2020
Vie 24 de Julio de 2020
Sáb 25 de Julio de 2020
Dom 26 de Julio de 2020
Lun 27 de Julio de 2020
Mar 28 de Julio de 2020
Mié 29 de Julio de 2020
Jue 30 de Julio de 2020
Vie 31 de Julio de 2020
Sáb 01 de Julio de 2020
Dom 02 de Julio de 2020

Publicidad

Lo más leído »

Publicidad

Más Secciones »

Hola Invitado