Integración de Apache Spark y MongoDB con PySpark#

Objetivo del Notebook: Aprender a conectar Apache Spark con una base de datos MongoDB para realizar operaciones de lectura, análisis y escritura. Este es un patrón muy común en arquitecturas de Big Data, donde MongoDB se utiliza como un sistema de almacenamiento flexible y escalable (Operational Datastore) y Spark se usa para el procesamiento y análisis de datos a gran escala. ![azure-synapse-analytics-integrate-mongodb-atlas-architecture.svg](data:image/svg+xml;base64,<svg width="960" height="420" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" overflow="hidden">
  <defs>
    <filter id="fx0" x="-10%" y="-10%" width="120%" height="120%" filterUnits="userSpaceOnUse" primitiveUnits="userSpaceOnUse">
      <feComponentTransfer color-interpolation-filters="sRGB">
        <feFuncR type="discrete" tableValues="0 0"/>
        <feFuncG type="discrete" tableValues="0 0"/>
        <feFuncB type="discrete" tableValues="0 0"/>
        <feFuncA type="linear" slope="0.470588" intercept="0"/>
      </feComponentTransfer>
      <feGaussianBlur stdDeviation="2 2"/>
    </filter>
    <clipPath id="clip1">
      <rect x="0" y="0" width="960" height="540"/>
    </clipPath>
    <clipPath id="clip2">
      <path d="M487.334 104.5C466.991 104.5 450.5 120.991 450.5 141.334L450.5 288.666C450.5 309.009 466.991 325.5 487.334 325.5L697.666 325.5C718.009 325.5 734.5 309.009 734.5 288.666L734.5 141.334C734.5 120.991 718.009 104.5 697.666 104.5ZM1.43051e-05 8.04663e-06 960 8.04663e-06 960 540 1.43051e-05 540Z" fill-rule="evenodd" clip-rule="evenodd"/>
    </clipPath>
    <image width="960" height="500" xlink:href="data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAA8AAAAIcCAMAAAAOgzdJAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAADUExURQAAAKd6PdoAAAABdFJOUwBA5thmAAAACXBIWXMAAA7DAAAOwwHHb6hkAAACDklEQVR4Xu3BMQEAAADCoPVPbQsvIAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAADgoAbrhQABeiFAsAAAAABJRU5ErkJggg==" preserveAspectRatio="none" id="img3"/>
    <clipPath id="clip4">
      <path d="M487.334 104.5C466.991 104.5 450.5 120.991 450.5 141.334L450.5 288.666C450.5 309.009 466.991 325.5 487.334 325.5L697.666 325.5C718.009 325.5 734.5 309.009 734.5 288.666L734.5 141.334C734.5 120.991 718.009 104.5 697.666 104.5ZM1.43051e-05 8.04663e-06 960 8.04663e-06 960 540 1.43051e-05 540Z" fill-rule="evenodd" clip-rule="evenodd"/>
    </clipPath>
    <clipPath id="clip5">
      <path d="M487.334 104.5C466.991 104.5 450.5 120.991 450.5 141.334L450.5 288.666C450.5 309.009 466.991 325.5 487.334 325.5L697.666 325.5C718.009 325.5 734.5 309.009 734.5 288.666L734.5 141.334C734.5 120.991 718.009 104.5 697.666 104.5ZM1.43051e-05 8.04663e-06 960 8.04663e-06 960 540 1.43051e-05 540Z" fill-rule="evenodd" clip-rule="evenodd"/>
    </clipPath>
    <clipPath id="clip6">
      <rect x="0" y="0" width="533400" height="419100"/>
    </clipPath>
    <image width="72" height="66" xlink:href="data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAEgAAABCCAYAAAD0dpAhAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsMAAA7DAcdvqGQAAAnmSURBVHhe7Vx/jBRXHb/6s9S7/TUzdweeXG2R0mKKpv7A0IaEwu2bnZm9w9w1phV2Zu96glqtxmhpjPxTrTXa0qQF2zRSaW9n98DSghiD/RGxqIFaUqsn6lGLSIFCAYXrNdZr/XzffO9gu3u3e7uzYTfZT/IyM5958/Z9PvN9782bnZkGvzD7RiPc0hed3+wYS5WkuEm1o19XHfE9pA2KLTKqrf9KtcVebA9heRDbJxRHH1Ud/e0pk62/hXxv4Jj/KI44jvVhLmenLNfRf4w8d2pO9EvKymhcS8Q+NmvF9QpX68Liqp6e96GyQ6j4mRxhxSZbjEHofyFyFOLPYHka2yelITYZo7+V97gCCWWgLLGFq1pZRG4RASURXcGbWYAIzxxbnIaYP2P9SaRHwd8dsfXblYS+SnViN0ScKCJL/4SSMOeF7Y4PBXo7InNuEe/nYqbENf3979V6FjfOWtGlaCvF5VQOyl6mrozdgEjqj9jRNfjt9TBlO5YvgDvJRm3jIiYws9+8BPt7G3p63s1UeYg44ir80CH6QaoY0xNQkvFZkZtEgDerBlSnNpwE3pwAGeqZJ/5IJ4rp0kAFoPkc4QJ3K4mOebyrZoEo+yTSAXnCscxnYtFAxDzABW1vWLv4PUxnQYa1I7aqydh96Ji/rSb0m9GsujRHXBvpjV2JTrO1PbH4Ys7uL9BMghgc6HdocNBskUATux11vh91elyz9TWcMwvUvNHf/UFqc2I/Ynr6QL8i23JLn/VhprLQ1r1wBu0vJlGHCzNpFPoHRLwIbg+2fw0hO8HtIEHgfibNtvUnyHjwP8fyF+B/ifVdiOZ9WA6jrGPgR8bLnjTZ+u+5qjlo7hVXUx6UdZSp6aG5d0lLMQVEeo1PaXbss9pK/cs4Y3dAxIMkEsf9BgYPYfkKxLyeVXG/kq2/id/CSRR/QXoKaRM4OeQ3J/UuJRm9gquZF2Q0ldPy+WXNTBUP6n+4EgeZKgs0YjVhFAomopeGV0bnU4ePJnkdjW74HZ2uYahpKkkjDkMtNFmDeDRRoSVFBzVZzTEW0ChGgtq6u2dw0SWDtJHGkjprvw2qRtQNKoC6QQVQN6gA6gYVQN2gAqgbVAB1gwqg5gwKuNbqkGtcx5sVR00ZFEqZ3w+lrbeRRsMD5meYrihqxqBw2vyCNMc1xzyTzKNBt/NS3l0x1IRBoc3W1TDmdc8ga3XQtbbRejBtvjDzAfMSzlYRVL1Bwa2dIZjzkmeItU5yA0YYETQsOde4T2asEKreoIBrPuhFjvl8+8ZzN9Ua3diVMOysNCllLGfad1S1QYgSy4sS80xgkzWH6QnAoFvZvGPNKauFaV9RtQaFB3uC6GP+KQ3IxL/KdA5g4k428WGmfEXVGhRIWz+Q5qTNSW+JEmgkowhjk65n2jeUZVCzY1xGByu2PsyULwgPmrNhzKhnkLGA6UkRSllrZF7X3MeUbyBtpJG0MlU8cPASGUGOeJYpX4BIeJSjZyNTU6It0z0DQ/9BL4osm2lfQNo4CJYwVTwUR6yTBtniLqbKRmhLrB2R8D8IfSPsdhYd1jjmRs8gcz9TvoC0SYOglaniQCEHV+VfyvQwANNlI+TGf8iR8FOmisPate/CReQBGXkZI85s2SBtHEFnim5mrclYOzqv/d6B4idMlw26zkGzOiUNGpy+6ZirfUUa5JpPM+ULSKPXUvT9pJ3pc9C+2NMY6RNtkYRYiFD7LjKfkgc4YijcvzTI2cpGKBU3pMC0tZepaSE8uDQIc2hKMqYNxlqZLhukkbR6mvVT5EHEiX2aPCFvGtSE+A7vnEjINBhMdIa4DF+AC79NZFDAjd/G1LSB4x+jMiiamPIFpJU0v9MH8iavQQi39W1fK/+PufMx3oeEU9Z8prJAV8togsO4gNzOVA4wNfE667T5OFO+gLSS5hwfyCAlGW+iCyUtoS9CZ3X3+H/eWD7r10MHwY2YlJIw1xppGMz/XI4yYM6T0ZG2hpjKQcDtupzL+RdTZYM0klbWPEIekBfkCXnD2c5B7dPnjj+0gMz3MF0WwmlrkXfmrd8ylQPV1ed64qceyrH/BOXzqx8ijTJaSDO0Mz016HESHETPBY7ROtMlI5zujJIoNLMdTOUgkLHmyDxp629M5QWa158oXwQzfqZKhtQJjaR12jpVR2zgKLqDqZKBvkXO3EMZaytTOQgOGJd5BplTTm1g8h7Kh/zXMFUylIQctdHfiA1MFQ9+6gIjmr6LqZKBzrWDxT/FVA5oYirzuOZLTOUF9u+TBm2Of5ypkkHaSCNpZap4+DlZ/YAcoWT/8hpTOfAmsZTHepmpXKCDRx6a6I7N3Fb+7VjSRhpLmqz6fbuDRh4yIJw2PspUFmhuxgZN+nuh9PIFlKdQMywWpI00lnS7w3eD0uY6FvcQU1mIZIwPSoPS1iGmcoBj13t5zOlNLidBdRlEfQxm8hA5SrN6piegPrJ8Jht4mKks0LCO6JI3z5oGjI8wXRaqyiAC+qCHpQmu+Tu6z8O0hHclTfusI0xlga6wZfS45hNMlY2qM6hxwFQh1LsX7Vpbzu9oWzNCYwOOMTWBYNq4Vx6Tto5TU2S6bFSdQQQlE78CEfSqJ9jcO/4PatNjXQob96rMCHicOegZZ400pa2FvMsXVKVBBNXtmkvXO55w8zTSN5RUfBabdoLyBFPxbphyyOOs4zRdkQf7iKo1iNCIJgVj+P60jJxXeDkC8/aO8xjVdgcz+R9kLxdVbdA4It5Vtrw6Pj+hrzoczsRv5mwVQU0YNI7ggNklzXGtN4MpI1npBxcINWUQjWgyclzzLFMVR00Z1P5M4mJuXiNMVRw1ZRC9bgVznkSHnPOWYKVQWwZdANQNKoC6QQVQN6gA6gYVQFkGtaxa1kwHK7Yo7Z3OGgBpI42klanpAQ6/Jk1KmDX/Ovg7QZq4hUx6n7wgNFvc60WR/nxZ75dXGUgLaSJtpJHp6YNenkUYypfwVVu8rPaV/1/UhQZpkFrkiRcHSnrj+XxwZ/2c/AfSzr0XTJ+m8PMxGb8we7URzvcVGNLA/6Y+V1LnPAkuUpIi71OmCn2xhc6Go59F2P4d67tV7735h1CRuxRH3KYk9VVqUnwO+0zsW0yvhNOr4fJJNhjc6giNHkGh723Q0hPXpRDfsmp5M329gV4Hl6+FO+JalKPTB1NQflJNiFvpd9CXPAL+aUTFX6kuVCekvNMW1nKRt1VBjH8eB5X7N1eoipI4hZO0mat64UGPimi9HXO0Xn0RfSAAFezFmfymmtTvxPoGVDaFs7sdlX8GaQ9MfRH76Z/NQ4iyo0gnwZ2mJdIJGH8cxxzD+hHwhyk6sU6fptiFtAPraYpSpHu0pPiWkoitwPoS+tqCfDLMFzQ0/B8RahGGjyzGzwAAAABJRU5ErkJggg==" preserveAspectRatio="none" id="img7"/>
    <clipPath id="clip8">
      <rect x="0" y="0" width="457200" height="419100"/>
    </clipPath>
    <clipPath id="clip9">
      <rect x="0" y="0" width="476250" height="495300"/>
    </clipPath>
    <image width="51" height="53" xlink:href="data:image/jpeg;base64,/9j/4AAQSkZJRgABAQEAYABgAAD/2wBDAAMCAgMCAgMDAwMEAwMEBQgFBQQEBQoHBwYIDAoMDAsKCwsNDhIQDQ4RDgsLEBYQERMUFRUVDA8XGBYUGBIUFRT/2wBDAQMEBAUEBQkFBQkUDQsNFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBT/wAARCAA1ADMDASIAAhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEAAwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSExBhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElKU1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwD9UPJT+4v5UeSn9xfyp9fP/wC0F+2T4R+Bt1PoqRzeIPFiKp/su2yiQlgCplkIwuQQcKGbkcDOa6sPhq2LqKlQjzP+vuOXEYqjhKfta8uVf1957te3FnptpNdXckFrawqXkmmYIiKOpLHgD6182+Nf28/h74X8TQaZp1rdeI7RZNt3qNkFWKMese7HmnP0GOhNeV6X8LPjZ+19eQ6p4+1GTwV4KZhJDpixmMuvUFLcnJP/AE0lOecgEcV9F6V8F/hL8E/BNzptzpGnmyuo/LuptURbi4vcc4JIyeedqgAHkAV9HSweX4OShim61R6csNl6y6vyXXc+XxGYY7EU5V6LVCjHVzqdfk9EvN/I7r4f/Ejwr8UtGXU/DGqWuqW3HmLHxJCT/DIh+ZD9RXUeSn9xfyr4Z8J/Bmxtfidc+IvhZZa7YxwgmK3+2bY4zySpYkfKeMI7sD3z0HWeFf29IvDvjKfwr8UvDGoeEriNwiX8kRJx0DSxAZAPUMm4HPpzRjchlCTeDfNpdxdudeqT1+X3Hl5Jxlh80lKFSDUU7KrZ+yk+vLJ2f4W8z668lP7i/lRUWn39vqthbXtnMlzaXMazQzRnKyIwBVge4IINFfJtNOzP0VNNXRYr410XS7LVP+ChevfbLSC78iATRefGH8uQWcOHXI4YdiOa+yq+PvC3/KQzxL/15n/0jhr6nIdIYz/rzL84nyuffHgv+v0fyZP8XPjdrln8QNVsI9SudMh0+4aCKCCUxjC9GOPvbuvPY1k+B9e8OeMvEzaj461bUrne37sY3REejODuC57Ko+tfTPjD4K+CfH2rRanrugQX1/GAPP3vGzAdA+xhvA/2s1d174ceFdW0X7HeaTZwWsEWyOSKNYmgUDjawxtAA+nFd9LPMHChCjGk4u1m42TXo+t/kfl+M4Bzatja2Onio1VzOcYVOdwe9lJXSVlorXVuljZ8PwaXb6PbLoyW6aaVzD9lx5ZHqMda+c/+CgWlWU/wd0+/ktIJL6DVoY4rlo1MkatHLuVWxkA4GQOuBXGaN8YL/wCH17eWOh6oZNNjuWKJMiusgBIB9sjGdpFdF+2R4kXxd+zHoWsrH5IvNStpDH/dPlzAge2Qa3weVVsvzbDVZS5oymrPr8/M9PA8X4TiLKMThIUnSq0oax05dHb3Gt0tN0t+u59A/CP/AJJT4L/7Atl/6ISij4R/8kp8F/8AYFsv/RCUV8Tiv94qer/M/XsJ/u9P/CvyOtr4t+M2l+Ofgf8AtE6n8XLDw6uveHrlFR2iZmEcfkRxuJNozGcoSGIK8j6V9pUjKHUqwyp4INduW5g8vqyk4KcZpxkn1TtfVbPTc4szy9ZjSjFTcJQalFro1to999jxTQf2rvDHiTwbFrNrY6hFeyfL/ZtxEUYNjr5n3Sn+0Mn27VhyeHvHvxzlWTU5z4f8NscrCFKh17EJnLn3bA9PStz40/s02fxG0uRvD2qSeFNV5bdbLiCc+jqMFfqpHXkNXhLfFr9oL4L/APFHatoZ8TX1z+40rU3ge6LH/ZkT/W8ZOH+YdTxxX1uDoYWvTdTKOVVr7VHrFf3b6O3fex+U5hhM2xGIcOJqjeFW0aKajP8A6+P4tf5VZX2PTfHv7PPwl8A2MOueJte1DSLOEfvEe8UC7YckBAhcsfSPH9a8Z+KXxV1b9qC1svh38NPBs/8Awj2nzRyLcSDawCKyIWOdkSYY/eJJ46Hiu38FfsgeKPiZrKeKPjLr11dXEnzDSYZg0gXrsdx8sa/7EY79Qa+rPCvhHRfBGjxaXoOmW2lafF92C2QKM+p7sT3JyTUTzLDZZKM3UeJrx2d37OL8usn57H0GEyOWKjKFChHC0JWTSS9pJLo/5V+PqReA9FuPDfgfw7pF2UN1Yadb2sxjOV3pEqtg9xkGit2ivgZzdSTnLd6n6VTgqcFCOy0CiiioLCiiigAooooAKKKKAP/Z" preserveAspectRatio="none" id="img10"/>
    <clipPath id="clip11">
      <rect x="0" y="0" width="476250" height="494926"/>
    </clipPath>
    <clipPath id="clip12">
      <rect x="0" y="0" width="1104900" height="495300"/>
    </clipPath>
    <image width="135" height="61" xlink:href="data:image/jpeg;base64,/9j/4AAQSkZJRgABAQEAYABgAAD/2wBDAAMCAgMCAgMDAwMEAwMEBQgFBQQEBQoHBwYIDAoMDAsKCwsNDhIQDQ4RDgsLEBYQERMUFRUVDA8XGBYUGBIUFRT/2wBDAQMEBAUEBQkFBQkUDQsNFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBT/wAARCAA9AIcDASIAAhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEAAwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSExBhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElKU1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwD9U6KKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAwdW8TxJd6lo+my29z4ktrFb5NPkcBjG7OiPjPKlo3H1HbNfMXj741atoOoPbeIdavtMuf8An1SOSL8MKB+teFf8FI/Eeq+E/wBpDwzqei6ld6TqMHhuAxXdlM0Uqf6VddGUg1574e/4KBfF/R7FbPUNR0vxPbqMBda05JDj3Mewt9SSa+Pz/IKud8qhip0kukdn67fr6GVRVHH920n5q/6o+gf+F/af5mf7Z1LP9/Mn+Nd38PvjNrPiC/S38O6xfarODzbSRyS/gQ44H0rzP4JftYav8SLHVLrUvBHgm3ntZURHtNKdc5BJJ3SHnivSr347+LLi1+z2k9rpMHTZYWypj6E5I/Cvj8NwDXoTVSGOnFr+W9/zOSnDFRl+8nFryT/zPqbS9VMy2lre+VBq8lsJ5rWNt2zoGP03HH59cVpV85fs131zqXjbWrm7uJbq4kssvLM5dm/eL1Jr6Nr9ejFwiot3t1fU9AKKKKoAooooAKKKKACiiigAooooAKKKKAPy+/4Kh/8AJe/D/wD2LUH/AKVXVfHtfff/AAUO8D2fir4u6NNLJJBcR6FCgkTkFftFwcEH6mvm3SfhppGm288cytevMuxnl42j/ZA6H3619FhcixeKjGpGyjJXvf8Apnw2YcYZbl9SdGpzOpB2aS/G+34nUfso/wDIF8Qf9fEX/oJr3evI/g3oMXguy1aK1leZLiVHHm4yuARjjrXoUOpyxZDfvATnmr/sDGKLbSuul9zJ8aZVzxUXJxe7tt+v3XPoH9l7/kbNW/68f/ai19KV8u/soXj3XjLWQwCqLDgD/rotfUVeLisNUwtT2VTc+sy/H0cyoLEYf4W2tdNgooorkPSCiiigAooooAKKKKACiiigAooooA+H/wBua1mT4oaNctEwt5NHSNJSPlZlmmLAH1AZf++hXzjX6qeLPBmh+OtLbTte0y31OzJyI515U/3lYcqfcEGvDdf/AGH/AAbqEryaXqeqaSWPERdJ41+gYBvzY1+g5Xn2GoYeFCumnHruv8z8U4h4Nx+LxtTGYRqSm72bs19+n4nyR4M/1Fz/ALy/yNdHX0No/wCxPa6R5oHi2aVHIODYAEY9/MrrdF/ZQ8LWMiyX97qGpkdYy6xRn6hRu/Jq76ufYFNuMm/k/wBbHhUODc4laM6aj6yX6NnDfsi28reKtcnEbGFbII0mPlDGQEDPrgH8q+pqzdA8OaZ4W05LHSbGGwtF58uFcZPqT1J9zzWlXwGYYpYzESrJWTP2vJMtllOBhhZS5mrtv1dwooorzj3QooooAKKKKAP/2Q==" preserveAspectRatio="none" id="img13"/>
    <clipPath id="clip14">
      <rect x="0" y="0" width="1096156" height="495300"/>
    </clipPath>
    <clipPath id="clip15">
      <rect x="0" y="0" width="733425" height="409575"/>
    </clipPath>
    <image width="150" height="84" xlink:href="data:image/jpeg;base64,/9j/4AAQSkZJRgABAQEAYABgAAD/2wBDAAMCAgMCAgMDAwMEAwMEBQgFBQQEBQoHBwYIDAoMDAsKCwsNDhIQDQ4RDgsLEBYQERMUFRUVDA8XGBYUGBIUFRT/2wBDAQMEBAUEBQkFBQkUDQsNFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBT/wAARCABUAJYDASIAAhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEAAwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSExBhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElKU1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwD9U6KKKAMnxR4ktfCujy390cqvypGp5kc9FH+egNeTW9543+JUzz2c72FhuIUxymGIe2R8z/r+FXPj1dSteaRag/ugjybfViQP6fqa9Z0vT4dJ062srdQsMEYjUAeg6/WvybFU8RxVnmJy2daVPDYZR5lB2c5TV9X2Vnp5een1dKVPK8FTxKgpVKl7N6qKWm3c8l/4Vf42/wChgj/8DZ//AImj/hV/jf8A6GCP/wADZ/8A4mvZKK9D/UDKf56v/gyRz/29i+0f/AUeN/8ACr/G/wD0MEf/AIGz/wDxNH/Cr/G//QwR/wDgbP8A/E17JRR/qBlP89X/AMGSD+3sX2j/AOAo8b/4Vf43/wChgj/8DZ//AImj/hV/jf8A6GCP/wADZ/8A4mvZKKP9QMp/nq/+DJB/b2L7R/8AAUeN/wDCr/G3/QwR/wDgbP8A/E0f8Kv8b/8AQwR/+Bs//wATXslFH+oGU/z1f/Bkg/t7F9o/+Ao8b/4Vf43/AOhgj/8AA2f/AOJo/wCFX+Nv+hgj/wDA2f8A+Jr2Sij/AFAyn+er/wCDJB/b2L7R/wDAUeNf8Kx8bx/MuvJuHTbfTA/+g0mk+PfEPgXV007xOktxat/y0k+Z1H95WH3x6g8/TpXs1cT8X9LgvvBV1PImZrRlkifuCWCkfQg/y9K8nM+FXkeEqZnkuJqQq0k5WlJyjJRV2pJ+S0/prrwuafXqscNjKcXGbtdKzTezTOygnjuoY5onEkUih0dTkMCMgipK434R3j3ngWxDks0LPFk+gY4/IED8K7Kv0vK8asywNDGpW9pGMrdrq9vkfN4qj9Wrzo/ytr7gooor1DlCiiigDxj47f8AIa0j/rk3/oQr2evGvjv/AMhbRv8Arm//AKEK9lr8y4d04jzr1o/+kSPpMw/5F2D9J/mgooor9NPmwooooAKKKKACiiigDkfGPxK0zwfcJbSpJdXjLuMMOPlHYsT0z6VN4N+IWm+MzLFbrJb3Ua7mgmxkr03AjqORXifxMdpPHWrljk+YB+ARQKv/AAdYr47tADgNHID7/ITX870OPczqcU/2e1H2Dq+z5ba25uW9979e3Sx+gzyLDRyz6wr8/LzX+V7WPoWuU+KX/Ihat/uJ/wCjFrq65T4pf8iFq3+4n/oxa/aeIP8AkT4z/r1U/wDSWfG5f/vlH/FH80Z/wX/5EiP/AK7yfzFd3XCfBf8A5EiP/rvJ/MV3dcfCf/IhwX/XuP5G2a/79W/xMKKKK+rPKCiiigDxr47/APIW0b/rm/8A6EK9lrxr47/8hbRv+ub/APoQr2WvzPh3/kpM69aP/pEj6TMP+Rdg/Sf5oKKKK/TD5sK5fx148tvBNnC7wtdXU5IihVtvAxkk4OByO1dRXinx5P8AxOtMHb7O3/oRr4jjPNsTkuS1sZhHaouVJ72u0r2em3c9rJ8LTxmNhRq/Drf5I3vCvxoi1rV4LC+sRZ/aGCRzRybhuPQEEdzxmvTa+UNH/wCQtY/9d0/9CFfVskiwxs7sERRuZmOAAOpNfL+HXEWPz3C4j+0Z80qbVpWS0aejsktLfienxDl9DA1af1dWUk9PQdRXndz8cNChvjCkN3PCrbTcIi7T7gE5xXeafqFvqllDd2somt5l3o69xX6Hl+eZbmtSdLA141JQ3Sf4+a81ofP4jA4nCxUq0HFPa585fEr/AJHrV/8ArqP/AEEVf+D3/I+WX/XOX/0A1Q+JX/I9av8A9dR/6CKv/B7/AJHyy/65y/8AoBr+SMH/AMlnH/sJ/wDch+sVf+RM/wDr3/7afQ1cp8Uv+RC1b/cT/wBGLXV1ynxS/wCRC1b/AHE/9GLX9acQf8ifGf8AXqp/6Sz8py//AHyj/ij+aM/4L/8AIkR/9d5P5iu7rhPgv/yJEf8A13k/mK7uuPhP/kQ4L/r3H8jbNf8Afq3+JhRRRX1Z5QUUUUAeMfHb/kNaR/1yb/0IV7PXjfx6hdNQ0e4x8hjdQfcEH+tevWd3Ff2cFzC2+GZFkRh3BGRX5jw6+XiXOYS0bdF/Llev4o+lzDXLcG1t7/5omoor5+8VfFTW7/WLgWV49jZxyMsUcOASAcZY9ST+Ve9xJxRg+GKMKuKTk5uyUbX03erSstPvOHLssrZlOUaTStu2fQNeJ/Hn/kOaZ/17n/0I11vwm8bXfiqxu7bUGEl3alSJgAN6tnGQO4I/UVyfx6/5DWmf9e7f+hV8RxlmmHzrg6WPwt+SbjvurTSafmmj2snwtTB5uqFXdX/I860j/kLWX/XdP/QhX0d8Q5Gj8E6yVJU/ZyOPQ8H9DXzjpH/IWsv+u6f+hCvoz4jnHgfWP+uP9RXxPh62skzdr+T/ANsme3xB/vuE9f1ifNFfQHwXkZvA8QJyFnkC+wzn+ZNfP9e//BX/AJElP+viT+leL4Wf8j6X/XuX5xOzif8A3Ff4l+p5N8Sv+R61f/rqP/QRV/4Pf8j5Zf8AXOX/ANANUPiV/wAj1q//AF1H/oIq/wDB7/kfLL/rnL/6Aa8PB/8AJZx/7Cf/AHIdtX/kTP8A69/+2n0NXK/FHnwHq3+4n/oa11Vcd8Wr6Ky8C3yuQHuCkUa+rbgf5An8K/rLiKcaeTYyU3Zezn/6Sz8qy5OWMopfzR/Mq/Bf/kSI/wDrvJ/MV3dcR8Hbd4fAtqzjAklkdfpux/Su3rm4Vi45Dgk1/wAu4fkjTNGnjq1v5n+YUUUV9UeWFFFFAGD408JweMNFkspG8qVT5kM2M7HH9D0NeU2PibxV8LlOn3tn59irHy/OBKDn+Bx2PofXoK90pK+JznhlZjio5hgq8sPiErc8dbrtKOif3+t7K3tYPMvq9J4etTVSm9bPo/J9Dn/A3ih/GGhLqD2wtW8xo9ituBxjkHA9a+abg7riU/7R/nX1sAFGAMCvky+jaG+uI2BVkkZSD2IJr8i8UqNejg8up4ip7ScedOVrXdoa2Wiv2PreGJwnWxEqceVO1le9t+p6f8A/+P8A1j/rlH/Nqh+PX/Ia0z/r3b/0KrHwDjb7XrMmPkCRLn3Jb/Cofj1Gw1bSnIOxoGAbsSG5/mPzrGaf/ENo6fa/9yspNf6xv0/9tPONH/5C1l/13T/0IV9GfEfnwPrH/XH+or510ONpta09EG52uI1AHclhX0Z8Q42k8E6yFBY/Z2bj0HJ/QVHh7FvI82st4/8Atkys/a+u4T1/VHzNXv8A8Ff+RJT/AK+JP6V4BX0D8GI2TwPEWBAeeRl9xnH8wa8TwsT/ALel/wBe5fnE7eJ/9xX+JfqeSfEr/ketX/66j/0EVe+D/wDyPlj/ALkv/oBqn8To2i8dasGGCZFYfQopFX/g3E0njq2ZRkRxSM3sNpH8yK8TCRl/rpFW1+s/+5Dsqtf2M3/07/8AbT0n4ifEafwVeWtvDYx3JnjL75HIA5xjAH9a4eLSfE/xY1SCfUUaz02M8PsKRovfYDyxPrz9a9yKg4JGSOlLX9JZnwtXzrFylj8ZKWGbTVJJRWltJSTvJX129GfnWGzSGDpJUKKVT+du/wBy6FfT7GHTLGC0t08uCFBGi+gAxViiivvYQjTioQVktEjwZScm29woooqxBRRRQAUUUUAFcF4w+GOjazffb2E9tcTOBJ9ncBXJ/iIIPP0oor4ri/D0cRlUvbQUrNNXSdtelz2soqTp4pckmr32Oo8N+GbDwrp4s7CMpHnczscu7epNM8TeFdP8Waf9lv42KqdySRnDofUH/HiiivTnhMP/AGO8P7OPs/Z/DZW2vttucsatT657TmfNzb3137nOeC/hro+iXX9oIJrm5jYiM3DAhPcAAc/Wu5dFkRkdQ6MMFWGQR6UUV5vCOGoYfKYKjBRTbbskru9tbb6HTm1Sc8XJzk3ax5vefB3QJNbQK13FDJ85hSUbR7DKk4/GvQ7Gxg02zhtbWJYbeFQiRr0AFFFefwrgsLhsTjZUKUYvntoktNdNFt5HTmlarUp0VOTenVmD4r+H+keL2SW9jkjuUG0T27BX2+hyCCPqKl8K+B9L8HxyCxjdppBh55m3Ow9M4AA+goorshgMIuIpV1Rjz8l+blV77Xva97aX7GLr1f7PUOd2va13Y6CiiivuDwwooooAKKKKAP/Z" preserveAspectRatio="none" id="img16"/>
    <clipPath id="clip17">
      <rect x="0" y="0" width="731384" height="409575"/>
    </clipPath>
    <clipPath id="clip18">
      <rect x="0" y="0" width="428625" height="428625"/>
    </clipPath>
    <image width="56" height="56" xlink:href="data:image/jpeg;base64,/9j/4AAQSkZJRgABAQEAYABgAAD/2wBDAAMCAgMCAgMDAwMEAwMEBQgFBQQEBQoHBwYIDAoMDAsKCwsNDhIQDQ4RDgsLEBYQERMUFRUVDA8XGBYUGBIUFRT/2wBDAQMEBAUEBQkFBQkUDQsNFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBT/wAARCAA4ADgDASIAAhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEAAwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSExBhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElKU1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwD9U685+Lnx58K/BnTxNrl0ZL6Vc2+m2uHuJvcDoq/7TED6niug8f8AjS38E6GbqQq11KfLtYWP+skxn8gMk+wr8vfiVcaprXxF1671i5lvb17uQmWU5OzPyADsAuMAcAV9lw7kUc2quVeVqcfvfp5d3/S+M4kz6WUUlGhG85d9l/weyPrXx5+0l4u8XfC+PxR4CeHSjGS11A0K3E0ajhxlhtyvB+7ypzXzp/w1H8W1m83/AITG535zg20BX8vLx+lbP7Oup6ppviZtOt7aS90u8AF0gHyw9hKc8D0PqPUgV7FqH7NfhTw7eXGs3K/abCRvNEUkm23t8847ZXPTJxjjFfcRp5Zk9WeFxFGMr6xfKm2n0d+3S58JKpmmdUoYvD15Ra0kuZpJrqrdH1tscR8P/wBuLx/YXsVtrel23iqAn5/s8PkXOPUFBt/DZ+Ir7L+G/wATdM+Jmirf2MF5YSgfvbLUbdoZ4z7gjBH+0pI96+TdW+Lng7wbE1ppNr9q28eXp8CxxA/73AP1ANcTJ+1Zq2j36XelaJDayRHKtLcM+fYgKOD6V5uOyFZmufB4X2T73sn/ANu/5HqYHPnlT5MZivarta7X/b3+Z+i9Fef/AAP+LFl8ZPANpr9qgguc+Re2obJgnUDcv0OQwPowor8trUamHqSo1VaUXZo/U6FaniaUa1J3jJXTPnT46+KNU8ZftYeGvCWlK1xHpkIgaFT8oeaPzJZD7LH5ZP8AuH1r1C6/Y/8ABetXkt/q0t/c6hMqiSSGYRJkDGQuD2A6k9K8j8M67Bof7cnjA35CTXSSW9sz8YcxxMv5ohH419Qf8JL6v+tfYZpjK+XRwtHCtwXsou60vzav8T43K8Hh8yliq2KSm/ayVnrblsl+B5T4q8D6R8A/CqS6dBJcaeXEakgGWSUg48xgMc464wMYA6A+a+H/AIt3GqXU9j4ljjm0W6+RYgmUgB4wR/Evrnn+VevfHjxLbP8ADXULad1L3MkMcKk8lhIrcfgrV8t19FkdNZlg5VcUrzba5uvTVeZ5mbN5fio0sM7QS+Hp10fkaPxS+DsmgxvqujZvdFcb/wB2dzQg88nuvo35+p8rv/AHiJtGbVV0DU20rbuN8tnIYAPXft24/Gve/hL42uNN8Wado1xKk2lXUuxopuQjEEjb6ZbAx0Oa+pV8RqsewbQgGAo6Y9MV043iDFZJKOHqwVR7qV7XX3bnm4fhzC50pV6U3T6Wtez+/Y+P/wBhHxtLofxUvvDbyH7HrVozLGTx58ILgj/gHm/p6UVL4X8OweG/23tMg0q1a00+S8klgVUKphrRmlCdtoZnGBwMYor5Liz2dbGU8VSVvawjL81+SR9NwmqlHB1MLVd3SqSj+T/Ns6D9pL4X3P8Aw054e1aFXjstaSO5knjJG2S2UBxkdDsWLB9Wqf4kfHi58CaxZ6bbW0V9K8PmzGRyu0E4Ucd+D+lfWnjTwpB4s0ryHVFuYiXt5mHKPjHX0I4NfmL8YptStfip4gh1W2ls7qK5MQjmGD5agKjD2ZQGBHXOa+gyL2GfKnh8XFP2MbW762XntZfI8HPnXyB1MRhJNe2le/bTXy3v959Gzavpfxh0WOdbkrNFygH37diOQV7g4/HHWsbw/wDA3xb4lu54LRLWK1i4+3XE22Jj6DALZ9scflXztoXie80S8S6sbl7adejIcfgfUexr7I+A/wAZrfxV4Shs57iGPWbVnE0C/KXBYkOozyMHB9CD04r080p4rIMM54JJ077NX5b/ANficWU4nDZ9iVTxb5alt0/it09Twj4peBvEHwx1GGHVo1j83L293ayFo5CuM7WwCCOOCAehrLu/2jPGUOlG0TUIt+3b9qMIMuPr0z74zX0B8cfFXhDxVY2/h/WtVtY7mGcXG37QEeL5WXk9s7uh9B7V4xp3wV8LeLtWh07RtcuL68uDiO3t7uGRj6nhegHJPatsDjsNj8JCtmlHVXd3HT1V+j69DkzDB4rA4ydHK6+jsrKWvo7fh1PS/wBjHxFrPxE8YX15q1vHcQ6LDuXUMYczSAoFI6ZKeYcjHTpzRX0h8FfhHpnwZ8FxaHpxaWR3NxdXMhBeaVupJAHAACjgcAd80V+WZ1i6WOxs6uHVobL0X+b1P1HJcHWwOChSxErz3fq/8lod9XnPxe+AvhX40aesWt2rRX8SkW+p2uFuIfbPRl/2SCPoeaKK8yhXq4aoqtGTjJbNHrV8PSxVN0q0VKL6M+PfGn7C3j7w/cSP4fuLLxNZ/wAAWVbafHujnb+TmuD/AOGZ/ix5/lf8IZfb89RJFt/7634/WiivvsJxjmXJyzUZW6tO/wCDS/A/PcXwblvMpQco36Jq34pv8TvPBf7C/j7xBcRv4guLLwzZ9XDSi5nx7Kh2/m4r7C+EXwF8K/BjT2i0O1aW/lUC41O6w1xN7E9FX/ZUAfU80UV87mXEGPzRcladofyrRf5v5s+kyzh7L8rftKMLz/mer+XRfJHo1FFFfOH0p//Z" preserveAspectRatio="none" id="img19"/>
    <clipPath id="clip20">
      <rect x="-0.25" y="0" width="428625" height="428625"/>
    </clipPath>
    <clipPath id="clip21">
      <rect x="0" y="0" width="600075" height="419100"/>
    </clipPath>
    <image width="115" height="81" xlink:href="data:image/jpeg;base64,/9j/4AAQSkZJRgABAQEAYABgAAD/2wBDAAMCAgMCAgMDAwMEAwMEBQgFBQQEBQoHBwYIDAoMDAsKCwsNDhIQDQ4RDgsLEBYQERMUFRUVDA8XGBYUGBIUFRT/2wBDAQMEBAUEBQkFBQkUDQsNFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBT/wAARCABRAHMDASIAAhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEAAwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSExBhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElKU1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwD9U6KSuW1bV5LmZkicrEpx8pxn3oA6b7RH/wA9F/76FKsqMcK6k+xr5T/aV+Fl7r/hu48Q+Hr29sdYsU82WK2uHRbiID5htBxuA5z7V8e6b4+8S6fMlzb+INTinjIZXF3Jwfzr63LchjmdF1aVazW6a2/E+dxubPA1fZ1Kd09nf/gH640V4V+yr8em+MHhaay1WRf+Ek0sBbggY8+M8LIB69j7/Wvda+cxWGqYOtKhVWqPaoV4YimqtPZhRRRXKdAUUVl65qZsogkZxK/6D1oA0WkSP7zqv1OKT7RF/wA9U/76FeXeMfDNr440S503UXnEcykCaGVo5Iz2ZWByCK/Nn4pab4w+F/jbUfD974g1RzbvmGb7ZJiWI8o33u4xXt5blscxbgqnLJdLdPvPLxuNlg0pOF0+tz9egQwyORS1+cH7Jf7Uet+C/GFn4e8T6pcal4c1GQQq93IZGtZGOFYMTnbngj3r9HlYMoIOQeQa58wy+rl9X2dTVPZ9zXB4yGMhzw0a3QtFFFeYd5S1i4+zafIwOGb5R+NeEfFjxldaPGlhYTGCZl3ySJ94DsBXs/iiXEcMfqSxr5b+IF6b/XL+XOR5hUfQcCvcyjDxr125q6ij5PiTGzwuFUKbtKbt8up2/wAI/HUni60vdI1RxcXcCZ3N1liPBz6kZx+NfE/xR8JnwR4+17R9pEdvcN5We8Z+ZT+RFe9/DDWW0f4paT82EuWa3YdjuBx+uK5b9sjRhY/Ea0v1GBe2SlvdkJB/TbX2eUWwmaSox0jUjf5r+meBOrLGZVCrUd5Qdr+T/pHzsviLVvDU0lzpGo3WmXDLtaS1lMbFc9CR2pv/AAuHxz/0Nusf+Bj/AONUtW6NXOV9TiaUJTvKKZwUZyjGyZ2H/C4fHP8A0Nusf+Bj/wCNH/C4fHP/AENusf8AgY/+NcfRXJ7Gl/KvuNvaT/mZ3Fr8YvHg3bPF2sD/ALfH/wAafJ8WvHMzbn8Waux9TdP/AI1y2kQ+aJMjOMVo/ZB6V0QwlKSvyL7kZvEVIu3M/vPXfgT+0d4o8I+OLCLW9YutV0O7lENzDdSF9gY43qT0IOPrXsH7c3gqO+0bQfFVum54X+xzSKM5jYFkJPoDkf8AAq+Qxa7SCBgivuvxRIPiP+yT9qmJeZdMjnOOT5kRBx+a185mGHhgcbh8VTVk3yv5/wBM9vB1pYrC1sPN3srr5HwYsJRgykqwOQR1FfrH+zj46f4h/Bvw5qs8vm3gtxbXLdzLH8rE/XGfxr8r/s49K+7v+CfOum48D+JNHY8Wd6k6A+kiEH9Y/wBa24nwqngva9Ytfc9P8jPI67jivZ9JL8tT6uooor8nP0E5fxdJtkH+zET/ADr5U15/Mkmb+8zH+dfU/jDPmSf9cTj9a+VdY+6/419bkK/iP0/U/OOLm+agv8X6HCQXRs/GOizA4K3sX6uBXV/tvWoz4Xuf4gJ4/wA9h/pXFXn/ACMWl4/5/Iv/AEMV6B+2yV/sXwyCPn82T8tor3qWmbYZr+9+RyYDXK8QvOP5nxlqveudb7xrotV71zrfeNfZYj4jkp7CUUUVympu+GY/MWf6itz7OKyPCrbVn+orf80elevQS9mjhqN8zK/2cV9q/APbqn7M95aSjcscN7Gc+mWYfzr4y80elfZH7NZP/DP+snHH+l4/74NfNcSpfU4NdJx/U9vJG/rMl3iz4z8qvrH/AIJ+3bR+K/FNn/A9kkp+okA/9mr5S8welfUH7ArM3xH8Qbfu/wBl/N/39Su7Pop5bW9P1Ry5S2sbT9f0Puqiiivww/VTmfFkW6Rf9qMr/OvlPxBH5Us6H+F2H6mvrbxRFuhhk/unH518vfEaxOn69fx4wrOXX6HmvqMinac4d0fAcW0m6dGquja+/wD4Y8us7M6h410SBRkvexcD2YH+ldL+27eDzvC9oPvbJ5D7coB/WrPwn0RtY+KGnybcx2e64ZvTAwP1Irif2wtbGo/E2OyVsrYWSIR6MxLH9CtfTYRe1zikl9mLf36Hm4VezyipJ/akl92p826r3rnW+8a6LVujVzlfXYj4jjp7BRRRXMam14fk8tZvqK2PtFc5psnlh6u/afevQpTtBI5pxvK5rfaK+2vgUy6L+y/dXs3yB7e8lbPoS4B/LFfCMczSSKiAszHAUdzX3h8Q4x8Nf2Tf7OdcTNp0NmR0O+QgMf1Jr5nP6ntY0MOt5TX4f8Oe5lMfZurW6Ri/6/A+KPOr63/4J8WTy+JvFd9/BHZxwH6s+7/2Wvjn7R7198/8E+fD7Wfw717WXXB1C+WJSe6xL1/Nz+VdXEVZQy6ou9l+Jhk9LmxsH2u/wPqqiiivxg/TCpqlt9qsZUAy2Mj6ivEvib4Hn8SwR3Niqm8jGwoxxvX6+or3iub1bRJFmaW3TejclV6g10UK88PUVSnujjxeEpY2i6FVaM8l+G3gdfh/pN9qGosi3kqF5mB4ijUZxn9TXwj8QPFD+M/GWt6y/wDy93LuoznC5wo/AYr7F/aS1TxreaDN4Y8JeF9YvZbsbLu/htX8tI+6I2OSe5HAFfLum/s7/Ee+mjtl8IanG8hChpoSij3LHgCv0jh3kh7TG4qaUp7Xa2/r8j4rNKfJCngsNBuMPLqcN4a+GPin4pX1zp/hXSJdXu4Y/NkSN0TYucZJYgfrXQf8Ma/GP/oSrj/wLt//AI5X6C/s4/AmD4J+E3inaO516+IkvbhBwMdI19h+pr12vLx/Ek/rElhknBbN31/E9TCZHD2KddtSfRW/yPyd/wCGNfjH/wBCVcf+Bdv/APHKP+GNfjH/ANCVcf8AgXb/APxyv1iorz/9ZMV/JH8f8zr/ALDw/wDM/wAP8j8prP8AYx+MUgbPhF4f+ul5Bz+Tmm3X7IHxZs5Nkvhva2M/8fcX/wAVX6t1n6xpv9oQjbgSr90+vtT/ANZcX/LH7n/mP+w8P/M/w/yPzv8Agj+yF4qXxxY6h4ws49P0ixkE5h85XadgcqoCk4GepNdN+3Z46jUaD4Ugk+dSb65Vf4eCsY/Vjj6V9V+MtQvvCeiXF7Fouoavcop8qzsIGkeRuw46D3Nfnj4++FXxh+Ifi3Udf1PwRrjXV5Ju2/ZHwi9FUcdAMD8K7cuxM8wxaxeLkkobLbX+v0OTG0YYPDvDYdNuW/oeVwmS4mjiiVpJZGCqq8kknAAr9cfgL4Fb4cfCXw3ociBLqG2Elxj/AJ6v8z/qT+VfJX7J/wCx7rq+KbTxZ4509tNsrB/NtdMuB+9mkH3Wdf4VB5weSRX3rWXEWZQxLjh6Tulq/U1ybBSoqVaorN6L0Ciiivij6cKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigD//Z" preserveAspectRatio="none" id="img22"/>
    <clipPath id="clip23">
      <rect x="0" y="0" width="595018" height="419100"/>
    </clipPath>
    <clipPath id="clip24">
      <rect x="0" y="0" width="533400" height="428625"/>
    </clipPath>
    <clipPath id="clip25">
      <rect x="0" y="0" width="467591" height="428625"/>
    </clipPath>
    <linearGradient x1="6.21522" y1="99.3641" x2="6.21522" y2="105.645" gradientUnits="userSpaceOnUse" spreadMethod="pad" id="fill26">
      <stop offset="0" stop-color="#C69AEB"/>
      <stop offset="1" stop-color="#6F4BB2"/>
    </linearGradient>
    <linearGradient x1="6.21522" y1="99.3641" x2="6.21522" y2="105.645" gradientUnits="userSpaceOnUse" spreadMethod="pad" id="fill27">
      <stop offset="0" stop-color="#C69AEB"/>
      <stop offset="1" stop-color="#6F4BB2"/>
    </linearGradient>
    <clipPath id="clip28">
      <rect x="0" y="0" width="1857375" height="971550"/>
    </clipPath>
    <image width="213" height="113" xlink:href="data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAMEAAABmCAYAAAByW/XKAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsMAAA7DAcdvqGQAAA8ASURBVHhe7Z0JlBXFFYZ7FAUUXFAEJQiDC4KgERWCURE0UQLCyKLRqOgxouz7royCRz0GjZG44B5AxBHFGBUGRDAoSPCgwai4hESNogIzwIC4wEy+v6n3zmMGZsgjAR75v3P+U9W3qrp7ePd2V1V3NZExxhhjjDHGGGOMMcYYY4wxxhhjjDHGGGOMMcYYY4wxxhhjjDHGGGOMMcYYY4wxxhhjjDHGGGOMMcYYY4wx6dCvX79Dhg4d2mjAgAFN+/Tpc1Awm+1RfMGxlTd0aHjUxk7HNUhHhR0b1l/ZoWH13LlRlbEzo7q5s6MG6WrCm9F+4bTKI4sftuagQYPqSX379q3VtWvX/UNZGSjbF2eokaivtnl5efuqjHzjgQMH5gwZMuSouHKGM2HChP34G9ugB9HL/G1zUftQbLbHug5HN16bk/3AupwGC9PUrPUXZbcbMzs6JXdmlDc6P1qYrsbOieqF09ouubm5lfiBb0GvB03mytciFJcBJ6hLENyZqM/2raRHDB8+/FDsb5D/iPS+Vq1aVQpNMpbBgwe3lOPzN31LuhEVkr8+FMcXhJA1qazJyW6+tmP2fJy5JE0VrO/U4KqbZ0WtCIIPc2dFJenqhllRo3Ba26V79+778eM+zY9bIpFfT9pXwRGqJCkpKcmirA11vk7UR9PoKvyof//+R+L8P1C2Gb2qq2holpHwdx2IhqHv0Seom+50PXv2rK2LBAEyDtuD+vcLTUyCTA+CoKk4dYNQJcmwYcMOpuw2VJxSNw4CgmYf9pNDIPyGtqeFJhkLTl6bv+de/Y2kean/HtimoO/QAv7u7XYd/2/J1CBAuoKvQWvRv/iB24YqSXDwY7EvC3VWkU8Ggcq5Ulbu1q1bldJXRwWInAX7AaQHoWqqK7vuLipTO9JKEsepqro0zUq0V32OdSDjjepKE+1VnoL2FbdXPfLVUBXZtxTHxHVk56peTfV0rNRzVjeHq319jvNQ+Bsn62/XOUoEyPRgX8R5HKRzcdcohQwOgk380G+QvsCPW0z+Jl35Q7XE2OFqyhUsC8i/GBwhGQTk/4Rm4UDD4kZAuwPYV2vaPI4UXN+zvY78S6Q/YSzRgPw4pL53dxxsFPZ30ec6Pu2rYW9L+VNqj76hbCXpDBw4h7IDw3H20XlgH4r+hv0bVET+L4mBuupgq0f7/qQLSRX036CP0XjaN6LO/pS1RRrvfE6qO8EKtABpYKzzTHQHdTGYxzk/Rb6NjmEgw4NgKlI/WI4hR28cqsVdIWxzUAG6C00IjpAaBIXYv2Mfj2mbtCq2bugj7JtJvyB9B70rG7oGBzqJ7dnk5WzLSL8klfOtHjly5JHkeymPvkXqm8vBlX5P2XdoqByXY9UgfzuSXU66hPSv6Ct0PKeTpS4N+Ty0CSkAdFd7j7qrSYuRguEMgqYrqY6xFum8iki1/XeJbY2b9LdrwLycVJMEF+pvNpDhQfAYP6YGvm+hjeQvTnQT2D4T6Ud/Cyf5BWmFQYCDa3ZFwSQHewN7T9Qc+0+Vx94mNQiQrqyPk15Dqiv6meQVFDqXfNpcFvZ5KdJV+QfsG7jztESax38fm+4092FvgtM3I+2tad/wd+YincsK6txJ+XmUn832KNIvkBx+NjqB/NVoZrDNpf61pDlBC2VHH6AubJ+f+DcwkOlBQHo4P+yDpJrpmdirV6/DVI+8nFNB8Ciqi8oNAvWd2R6AdFf5HFtnXbFVL5ClY5cKAnVxjgvlupNo6jbhbO2COYY2P8NWEMrvxtGbYFuu46OH0QmJABbqu6ucuprtmarACEXqJqkLpL9ZXT1Ng55OgOgu9AB57f8P5OuG6jr2M8G+sNTfZESmBwGmLPJX8QN/SrpCTko+G6nLoCtoZ/JHkC83CORkbP9edbC9hNQlKUNqEJCO5Yp+SCiSs+UH+zz2WSOYYzjvA7B/FsrnyGnJy1nV7YrHDKTD1AXS4JsgOEZlSOc4KuwmCXV/iV2Bv55zutJBsBPsBUEQyXH4gWchDZCHoZsoV36BnBF7hUHAGOJo8hNDnSnaVp3SlAqCoaTxQFewvSi0z8fZyjy3oFxXdpW/rnLSU7E9TLoy2FdzLvkcQwFwimykX2PrG3aRhDINhjWe2EDa3UGwE+wNQaDXINi+GWlg+B52DVS/JR2hcvIVBkGqE5HOUZ9ddUqTGgRoCEoNAl3NZZ+fmOFJoCfU2BP9+Gdk03mzrXM7l3OYQqoBrQbBt3P8+qQaD6yjbEy8kxSop3GI7gSaUcpxEOwEe0MQCJyzJT/ye9jVvZDzqP8dv4ZBWmEQyDnYB5tbuhiU6YlrZdUTlO+DY1cvLwjYzw2yU65uT7dgjmFb07XrSPXgrrfuBKiaysL7TXWwv4a03xeRngB/QBvVV3BlxzuCMH6ZqTL05YgRI2pVEATTgn0x/37JaWQT2FuCQE6KI0zDrhmYEvJPhqIdCgJtkzbHNh/J8TZQdj9O3xnn/zXbf0RbTZGi0ncCvae0QmXkP0G3oq5sa8Acz9VT/g+1IW2KXkWa4j2H/V5LqilN7fduBQlpT9rL0RWYc1WH9FfoBaQBtcrG6tjlBYH2F+yaulX+YtQkFJsMDYInUBEONCGYY8KP+xllG3Dc1sGsmZaa2MZTpgdSU2hXR3byeg6wJrEfBRLbeqC1mHQ9UtdEg1M54WfUu5wyzeroAZ32NYA2elIcE7o37Sj/ECmIUtvrgdtSnPVE1Q37iWd/Qh3V1R1oCYrvYD169DiU+r9jW0+7NdOlerrT6RmExg+P6N9DdQmQ2tjuQTqvhxKBLvi3OI26GnckjqXjXhyKTaYFgboO/OCXo9tL/5CaqcE2gh95bOprAXrdACfojP0Oyq9IPFlm+2a2b8PRttoPtibs/0bKnyOvq/40tgezjzq64pLVFfkOjncOQbBVH1vHVXvqjiHV6wqzqKunx4PVNlTTGOEw6uhJsMYCeqYwHY3m3OpTnHxtQl0wyvWO0wTSGUEPyJb6N6oe++qE/Q50SeqsFWgGrQPn/SR1NPAeT/1mocwUXnRMs3Uds19Aq9LUx0U5DS4lAM4YnR8tQqvSFUGQnHPfQ8jSdGXIp8UOtt/RY+zUuQT+G/vYuyi6MLvWug7ZXdZ2qN83Ha3pWP+6VR2ObXzLy1EdAuHK3Pyob7q6bX50aDgtY3YhD6+sHk0sODmaVHBWWnpidctoSlGtktnRwcWzotOKZ0RnpauS56Nk/9qYXceUohOjyQWP4NBL0tLkwnmkHUryo1M3z4yeK54ZLUlb+VFyGtCYXccTa5rjyPNRSVqaVFBAehV3gVYEwYcEQ0m6Yh8VDoy3hWZCGOidi9prwKdZnlBkzA6Q4UEghx+wZa5d7/6vIv+mHiaFYmN2gAwPAhw/e+DAgc+S6mFQLAIh+YzAmIrJ7CDIwuEvIgj0pFWvFOuBlJ72TlHZliq7HB23zLF1x3I3bU8lg4NAD4gIgltwfD0FfQZNIgD0QlnR4MGDa4dqMXqySlk97CdtS+ynKWltvVKNTmLfDVMfhCXe71Fd7YfteA1yv379Gob2WilWl7IWpKewXVVOT/7wsO/z+vfvf57qjhgxombYrdkjyOAgwLlOxuleQVqd1R+HO5e8xgbqFg0I1WJw6qMon0Q9vSZRRpRpOeIA0uFIaxEWs52crdJTZurdGerG7+ZQrlehF8lGfjTSmmW9mrA4BEZT8r9FeqFOr0XoLrWK9F6C5z/u+pn/FRkaBFxl9ZWGy3AqdYM+wMlbs10VB9MyRr1YtkhX6lA98ZqCXql4PiHaaQ3CJonthVypz6KO1iJoe5kWtoTm8SsZlGkZpF6Oe1rrDdhuTl5rfjUO+SepXpXWC3iTsbfHplQr1bR2WKvHtBpM7xXp8yeTE6vgzO4mQ4MAJ9KbofF3dpC6QfFLZ1yBMW95YQ0nTA6QFTRh9ZhWnWXj8FrErveP5PBaTnl1CKy0ggCt5Nha4XY6Nq01HkZaSJ330WV6f4n9V8F2CWXav94y9ScS9wgyNAhwIPW7tZa3iLtAPxws7r/jiMdg0+IadUvujytvA9q2olzdFF2pxyVeqsOeVhCQTky80DZy5EiNHR4Ndt112pNvIeluw7a6T9tcNml2BxkYBHoOgANehxNp7cBa8s/jXOMk8nfJwZD630txXr2VuRW008L7P1MuJ30FZ0++dkz7dIOgT6iquo2wvSw7+lp1kD7dEgtbvCwSjQtNzG4lA4MA56yJA+qdfjlf/H59cKqEZIu7KKh7aBYTVmXpq80bqafuysmhKKa8IKBNeWOCa0JVLXA5Efs82ZG+O6QBsxblpCoPJduY3UnmBUEWXY0f42RyfH3fR19e04BzK2FXN0fO/KymM0Nb3QV6oa8o151C6wi2mtNn3zdi15X6E8YQyVe72danXaaiCoMgdMniTx9SPk5jEQKqcmlpDBKamN1KhgVBGLzG3/fBwd7B4X4eiraCOvoIbXwlpt6Z4QvVZ6O3Zafd45o9KiU9S+iN4i+8kXbhePoeqQa0Z6D4e6ak5QYBZZpOHUOZnl8spVyLb+JvnpLuTwAc1Lt3b78suMeQYUEgB8KxNM2oLs90trf54Imydkh3As3r6+qu/6RDV3JNicqRNZX6XIqeJjCuxHk1YNY3TuXA6s+PRxpn6Km0vgZRYRAI7Qe9SbnOQcsZ72FbzzL0VYw55KeHqma3s2UtQV40uWB5eip8O5pU2KU4P2qxeUb0SvHMaHnamhMl++DbA2fSE1l9ue0dHGlQMJch9OEXqi717ifV9zr1yUK13ZaWEQT6TqjGDFrgrm8Iab2yPm6rb4BqiaTm+lX3Xj18Y796WKflk2XW7Kq7Q7mWdGp55qdI30TVrNCXaCn5eIG82RN4ak0NHLk1d4SuaWnyuo7Rk4X1i1+KauLIF2yaEXVNV8WvRdXDWW0XzbzgQHLo9uTLzPwkwJkrhenIrjhjaw1WSS8IbcsIx+xEeVO1Dd0izfdr3v961DYsem+mugRLS/avr1frVYnzQ/sy3Rvq6LWJ4ym7glRPokfRtofOhyCq8G81xhhjjDHGGGOMMcYYY4wxxhhjjDHGGGOMMcYYY4wxxhhjjDHGGGOMMcYYY4wxxhhjjDHGGGOMMcYYY4wxxmQ8UfRv005FC7r6QhYAAAAASUVORK5CYII=" preserveAspectRatio="none" id="img29"/>
    <clipPath id="clip30">
      <rect x="0" y="0" width="1831329" height="971550"/>
    </clipPath>
  </defs>
  <g clip-path="url(#clip1)">
    <rect x="0" y="0" width="960" height="540" fill="#FFFFFF"/>
    <g clip-path="url(#clip4)">
      <use width="100%" height="100%" xlink:href="#img3" transform="translate(1.43051e-05 8.04663e-06)"/>
    </g>
    <g fill="none" clip-path="url(#clip2)"/>
    <g clip-path="url(#clip5)">
      <g filter="url(#fx0)" transform="translate(443 99)">
        <g>
          <path d="M7.50006 44.3341C7.50006 23.9912 23.9912 7.50005 44.334 7.50005L254.666 7.50005C275.009 7.50005 291.5 23.9912 291.5 44.3341L291.5 191.666C291.5 212.009 275.009 228.5 254.666 228.5L44.334 228.5C23.9912 228.5 7.50006 212.009 7.50006 191.666Z" stroke="#000000" stroke-linejoin="round" stroke-miterlimit="10" fill="#FFFFFF" fill-rule="evenodd"/>
        </g>
      </g>
    </g>
    <path d="M450.5 141.334C450.5 120.991 466.991 104.5 487.334 104.5L697.666 104.5C718.009 104.5 734.5 120.991 734.5 141.334L734.5 288.666C734.5 309.009 718.009 325.5 697.666 325.5L487.334 325.5C466.991 325.5 450.5 309.009 450.5 288.666Z" stroke="#000000" stroke-linejoin="round" stroke-miterlimit="10" fill="#FFFFFF" fill-rule="evenodd"/>
    <path d="M223.02 144 264.095 144.814 264.055 146.814 222.98 146ZM262.821 141.788 270.74 145.946 262.662 149.787Z"/>
    <rect x="28.5001" y="21.5001" width="195" height="247" stroke="#000000" stroke-linejoin="round" stroke-miterlimit="10" fill="none"/>
    <text fill="#3C4043" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="13" transform="translate(59.4826 118)">Document data model</text>
    <text fill="#3C4043" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="13" transform="translate(67.8159 150)">Distributed systems</text>
    <text fill="#3C4043" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="13" transform="translate(90.6492 166)">architecture</text>
    <text fill="#3C4043" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="13" transform="translate(65.2359 198)">Cloud | On</text>
    <text fill="#3C4043" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="13" transform="translate(127.903 198)">-</text>
    <text fill="#3C4043" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="13" transform="translate(133.236 198)">premises</text>
    <g clip-path="url(#clip6)" transform="matrix(0.000104987 0 0 0.000104987 35 33)">
      <g clip-path="url(#clip8)" transform="matrix(1.16667 0 0 1 -0.03125 0)">
        <use width="100%" height="100%" xlink:href="#img7" transform="scale(6350 6350)"/>
      </g>
    </g>
    <rect x="271.5" y="65.5" width="38" height="161" stroke="#21313C" stroke-width="1.33333" stroke-linejoin="round" stroke-miterlimit="10" fill="none"/>
    <text fill="#21313C" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="700" font-size="15" transform="matrix(-1.83697e-16 -1 1 -1.83697e-16 294.272 211)">Change stream API</text>
    <g clip-path="url(#clip9)" transform="matrix(0.000104987 0 0 0.000104987 660 173)">
      <g clip-path="url(#clip11)" transform="matrix(1 0 0 1.00075 -0.5 -0.125)">
        <use width="100%" height="100%" xlink:href="#img10" transform="scale(9338.24 9338.24)"/>
      </g>
    </g>
    <g clip-path="url(#clip12)" transform="matrix(0.000104987 0 0 0.000104987 457 173)">
      <g clip-path="url(#clip14)" transform="matrix(1.00798 0 0 1 0 -0.125)">
        <use width="100%" height="100%" xlink:href="#img13" transform="scale(8119.67 8119.67)"/>
      </g>
    </g>
    <g clip-path="url(#clip15)" transform="matrix(0.000104987 0 0 0.000104987 757 132)">
      <g clip-path="url(#clip17)" transform="scale(1.00279 1)">
        <use width="100%" height="100%" xlink:href="#img16" transform="scale(4875.89 4875.89)"/>
      </g>
    </g>
    <path d="M0.0150754-0.999886 55.9772-0.156137 55.9471 1.84364-0.0150754 0.999886ZM54.6892-3.1759 62.628 0.944252 54.5686 4.82319Z" transform="matrix(1 0 0 -1 309 145.944)"/>
    <path d="M417.595 144.196 489.937 197.742 488.747 199.349 416.405 145.804ZM490.65 194.537 494.701 202.512 485.891 200.967Z"/>
    <path d="M0.696421-0.717633 65.172 61.8521 63.7791 63.2874-0.696421 0.717633ZM66.3045 58.7707 69.2598 67.2126 60.7331 64.5118Z" transform="matrix(1 -1.22465e-16 -1.22465e-16 -1 417 144.213)"/>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(379.527 188)">Event</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(369.447 202)">publisher</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="15" transform="translate(454.162 45)">Metadata of the change</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(489.286 246)">Delta data</text>
    <path d="M530 76 604.268 76C604.82 76 605.268 76.4477 605.268 77L605.268 172.759 603.268 172.759 603.268 77 604.268 78 530 78ZM608.268 171.425 604.268 179.425 600.268 171.425Z"/>
    <path d="M532.004 208 564.298 208.131 564.29 210.131 531.996 210ZM562.977 205.125 570.961 209.157 562.945 213.125Z"/>
    <path d="M626.015 203 663.758 203.562 663.729 205.562 625.985 205ZM662.47 200.543 670.409 204.661 662.351 208.542Z"/>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(592.325 235)">Azure</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(585.745 249)">Synapse</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(584.159 263)">Analytics</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(586.579 278)">pipeline</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(652.42 235)">SQL pools/</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(652.42 249)">Spark pools</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="19" transform="translate(534.518 354)">Synapse Studio</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(612.242 128)">Custom</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(614.909 142)">trigger</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(531.848 182)">Storage</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(534.595 196)">trigger</text>
    <path d="M0.757634-0.65268 27.745 30.6744 26.2297 31.9797-0.757634 0.65268ZM29.1477 27.7062 31.3386 36.378 23.0866 32.9276Z" transform="matrix(1 -1.22465e-16 -1.22465e-16 -1 734 187.378)"/>
    <path d="M734.679 189.266 763.074 215.512 761.717 216.981 733.321 190.734ZM764.132 212.404 767.291 220.772 758.701 218.279Z"/>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(777.199 124)">BI tools</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(772.256 271)">Atlas as sink</text>
    <path d="M0.00769948-0.99997 26.0656-0.799332 26.0502 1.20061-0.00769948 0.99997ZM24.7554-3.80951 32.7244 0.251969 24.6938 4.19025Z" transform="matrix(1 -1.22465e-16 -1.22465e-16 -1 825 227.252)"/>
    <rect x="861.5" y="213.5" width="16" height="24" stroke="#595959" stroke-linejoin="round" stroke-miterlimit="10" fill="#EEEEEE"/>
    <rect x="875.5" y="203.5" width="17" height="24" stroke="#595959" stroke-linejoin="round" stroke-miterlimit="10" fill="#EEEEEE"/>
    <path d="M874.5 224.5 891.193 224.5" stroke="#595959" stroke-linejoin="round" stroke-miterlimit="10" fill="none" fill-rule="evenodd"/>
    <path d="M882.5 225.5C882.5 224.395 883.396 223.5 884.5 223.5 885.605 223.5 886.5 224.395 886.5 225.5 886.5 226.605 885.605 227.5 884.5 227.5 883.396 227.5 882.5 226.605 882.5 225.5Z" stroke="#595959" stroke-linejoin="round" stroke-miterlimit="10" fill="#EEEEEE" fill-rule="evenodd"/>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(850.721 271)">Real</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(873.221 271)">-</text>
    <text font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="12" transform="translate(878.055 271)">time apps</text>
    <g clip-path="url(#clip18)" transform="matrix(0.000104987 0 0 0.000104987 371 122)">
      <g clip-path="url(#clip20)">
        <use width="100%" height="100%" xlink:href="#img19" transform="matrix(7654.02 0 0 7654.02 -0.25 0)"/>
      </g>
    </g>
    <g clip-path="url(#clip21)" transform="matrix(0.000104987 0 0 0.000104987 573 180)">
      <g clip-path="url(#clip23)" transform="matrix(1.0085 0 0 1 0 -0.125)">
        <use width="100%" height="100%" xlink:href="#img22" transform="scale(5174.07 5174.07)"/>
      </g>
    </g>
    <path d="M273 39.5C273 30.3873 280.387 23 289.5 23 298.613 23 306 30.3873 306 39.5 306 48.6127 298.613 56 289.5 56 280.387 56 273 48.6127 273 39.5Z" fill="#107C10" fill-rule="evenodd"/>
    <text fill="#FFFFFF" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="15" transform="translate(285.752 45)">1</text>
    <path d="M592 297.5C592 288.387 599.611 281 609 281 618.389 281 626 288.387 626 297.5 626 306.613 618.389 314 609 314 599.611 314 592 306.613 592 297.5Z" fill="#107C10" fill-rule="evenodd"/>
    <text fill="#FFFFFF" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="15" transform="translate(605.169 303)">4</text>
    <path d="M671 268.5C671 259.387 678.611 252 688 252 697.389 252 705 259.387 705 268.5 705 277.613 697.389 285 688 285 678.611 285 671 277.613 671 268.5Z" fill="#107C10" fill-rule="evenodd"/>
    <text fill="#FFFFFF" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="15" transform="translate(683.936 274)">5</text>
    <path d="M736 121.5C736 112.387 743.611 105 753 105 762.389 105 770 112.387 770 121.5 770 130.613 762.389 138 753 138 743.611 138 736 130.613 736 121.5Z" fill="#107C10" fill-rule="evenodd"/>
    <text fill="#FFFFFF" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="15" transform="translate(749.203 127)">6</text>
    <path d="M434 82.5C434 73.3873 441.611 66 451 66 460.389 66 468 73.3873 468 82.5 468 91.6127 460.389 99 451 99 441.611 99 434 91.6127 434 82.5Z" fill="#107C10" fill-rule="evenodd"/>
    <text fill="#FFFFFF" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="15" transform="translate(443.549 88)">3a</text>
    <path d="M736 255.5C736 246.387 743.611 239 753 239 762.389 239 770 246.387 770 255.5 770 264.613 762.389 272 753 272 743.611 272 736 264.613 736 255.5Z" fill="#107C10" fill-rule="evenodd"/>
    <text fill="#FFFFFF" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="15" transform="translate(749.115 261)">6</text>
    <path d="M377 98C377 88.6112 384.387 81 393.5 81 402.613 81 410 88.6112 410 98 410 107.389 402.613 115 393.5 115 384.387 115 377 107.389 377 98Z" fill="#107C10" fill-rule="evenodd"/>
    <text fill="#FFFFFF" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="15" transform="translate(389.687 103)">2</text>
    <path d="M444 201.5C444 192.387 451.611 185 461 185 470.389 185 478 192.387 478 201.5 478 210.613 470.389 218 461 218 451.611 218 444 210.613 444 201.5Z" fill="#107C10" fill-rule="evenodd"/>
    <text fill="#FFFFFF" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="400" font-size="15" transform="translate(452.672 207)">3b</text>
    <text fill="#3C4043" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="700" font-size="13" transform="translate(94.2829 55)">MongoDB Atlas on</text>
    <text fill="#3C4043" font-family="Segoe UI,Segoe UI_MSFontService,sans-serif" font-weight="700" font-size="13" transform="translate(104.95 71)">Azure as source</text>
    <g clip-path="url(#clip24)" transform="matrix(0.000104987 0 0 0.000104987 770 199)">
      <g clip-path="url(#clip25)" transform="matrix(1.14074 0 0 1 -0.5 0)">
        <use width="100%" height="100%" xlink:href="#img7" transform="scale(6494.32 6494.32)"/>
      </g>
    </g>
    <g>
      <g>
        <path d="M28.559 93.0254 28.559 91.2327 15.1847 91.2327 8.91027 97.5426 4.29539 97.5426 15.0072 86.5912 22.3112 86.5912 22.3112 84.8872 14.2618 84.8872 1.77495 97.5426 0 97.5426 0 99.3354 5.36036 99.3354 11.6881 105.645 26.7841 105.645 26.7841 103.853 12.3448 103.853 7.87192 99.3354 18.7346 99.3354 18.7346 97.5426 11.4573 97.5426 15.8947 93.0254 28.559 93.0254Z" fill="#32BEDD" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 491.84 -18.5722)"/>
        <path d="M5.14975 103.042C5.14975 104.479 3.99694 105.644 2.57487 105.644 1.15281 105.644 6.90633e-09 104.479 6.90633e-09 103.042 6.90633e-09 101.605 1.15281 100.44 2.57487 100.44 3.99694 100.44 5.14975 101.605 5.14975 103.042Z" fill="#FFFFFF" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 509.829 -23.1782)"/>
        <path d="M5.14975 103.042C5.14975 104.479 3.99694 105.644 2.57487 105.644 1.15281 105.644 6.90633e-09 104.479 6.90633e-09 103.042 6.90633e-09 101.605 1.15281 100.44 2.57487 100.44 3.99694 100.44 5.14975 101.605 5.14975 103.042Z" fill="#FFFFFF" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 512.53 -35.7998)"/>
        <path d="M5.14975 103.042C5.14975 104.479 3.99694 105.644 2.57487 105.644 1.15281 105.644 6.90633e-09 104.479 6.90633e-09 103.042 6.90633e-09 101.605 1.15281 100.44 2.57487 100.44 3.99694 100.44 5.14975 101.605 5.14975 103.042Z" fill="#FFFFFF" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 518.794 -29.489)"/>
        <path d="M5.14975 103.042C5.14975 104.479 3.99694 105.644 2.57487 105.644 1.15281 105.644 6.90633e-09 104.479 6.90633e-09 103.042 6.90633e-09 101.605 1.15281 100.44 2.57487 100.44 3.99694 100.44 5.14975 101.605 5.14975 103.042Z" fill="#FFFFFF" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 516.122 -16.8673)"/>
        <path d="M6.21522 102.505C6.21522 104.239 4.82389 105.645 3.10761 105.645 1.39132 105.645 4.81412e-08 104.239 4.81412e-08 102.505 4.81412e-08 100.77 1.39132 99.3641 3.10761 99.3641 4.82389 99.3641 6.21522 100.77 6.21522 102.505Z" fill="#50E6FF" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 511.907 -35.2615)"/>
        <path d="M6.21522 102.505C6.21522 104.239 4.82389 105.645 3.10761 105.645 1.39132 105.645 4.81412e-08 104.239 4.81412e-08 102.505 4.81412e-08 100.77 1.39132 99.3641 3.10761 99.3641 4.82389 99.3641 6.21522 100.77 6.21522 102.505Z" fill="url(#fill26)" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 518.171 -28.9506)"/>
        <path d="M6.21522 102.505C6.21522 104.239 4.82389 105.645 3.10761 105.645 1.39132 105.645 4.81412e-08 104.239 4.81412e-08 102.505 4.81412e-08 100.77 1.39132 99.3641 3.10761 99.3641 4.82389 99.3641 6.21522 100.77 6.21522 102.505Z" fill="url(#fill27)" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 515.588 -16.329)"/>
        <path d="M6.21522 102.505C6.21522 104.239 4.82389 105.645 3.10761 105.645 1.39132 105.645 4.81412e-08 104.239 4.81412e-08 102.505 4.81412e-08 100.77 1.39132 99.3641 3.10761 99.3641 4.82389 99.3641 6.21522 100.77 6.21522 102.505Z" fill="#50E6FF" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 509.295 -22.6398)"/>
        <path d="M1.68621 94.0992 5.50236 94.0992 5.50236 104.776C5.50236 105.255 5.11809 105.643 4.64407 105.643L0.860853 105.645C0.386832 105.645 0.00256129 105.257 0.00256129 104.778L0 95.8032C0.000470042 94.8625 0.755147 94.0992 1.68621 94.0992Z" fill="#999999" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 484.27 -39.7478)"/>
        <path d="M1.68621 94.0992 5.50236 94.0992 5.50236 104.776C5.50236 105.255 5.11809 105.643 4.64407 105.643L0.860853 105.645C0.386832 105.645 0.00256129 105.257 0.00256129 104.778L0 95.8032C0.000470042 94.8625 0.755147 94.0992 1.68621 94.0992Z" fill="#999999" fill-rule="evenodd" fill-opacity="0.5" transform="matrix(1.00301 0 0 1 484.27 -39.7478)"/>
        <path d="M0 94.0992 3.81615 94.0992C4.74721 94.0992 5.50189 94.8625 5.50236 95.8032L5.50236 104.776C5.50236 105.255 5.11809 105.643 4.64407 105.643L0.825354 105.645C0.362929 105.629-0.00361739 105.246-0.00334223 104.778L0 94.0992Z" fill="#999999" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 529.213 -39.7478)"/>
        <path d="M0 94.0992 3.81615 94.0992C4.74721 94.0992 5.50189 94.8625 5.50236 95.8032L5.50236 104.776C5.50236 105.255 5.11809 105.643 4.64407 105.643L0.825354 105.645C0.362929 105.629-0.00361739 105.246-0.00334223 104.778L0 94.0992Z" fill="#999999" fill-rule="evenodd" fill-opacity="0.5" transform="matrix(1.00301 0 0 1 529.213 -39.7478)"/>
        <path d="M50.3111 101.909 50.3111 105.645 0 105.645 0 101.909C0.000470042 100.968 0.755147 100.205 1.68621 100.205L48.6604 100.205C49.5776 100.225 50.3108 100.982 50.3111 101.909Z" fill="#949494" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 484.27 -45.8492)"/>
        <path d="M0.860853 94.0105 4.67701 94.0105C5.15103 94.0105 5.53529 94.3992 5.53529 94.8775L5.53786 105.645 1.68621 105.645C0.755147 105.645 0.000470021 104.882-2.11591e-08 103.941L0 94.969C-0.0490442 94.4924 0.293405 94.0664 0.764883 94.0167 0.795913 94.0131 0.827095 94.0114 0.858292 94.0123Z" fill="#999999" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 484.27 -6.01038)"/>
        <path d="M0.860853 94.0105 4.67701 94.0105C5.15103 94.0105 5.53529 94.3992 5.53529 94.8775L5.53786 105.645 1.68621 105.645C0.755147 105.645 0.000470021 104.882-2.11591e-08 103.941L0 94.969C-0.0490442 94.4924 0.293405 94.0664 0.764883 94.0167 0.795913 94.0131 0.827095 94.0114 0.858292 94.0123Z" fill="#999999" fill-rule="evenodd" fill-opacity="0.5" transform="matrix(1.00301 0 0 1 484.27 -6.01038)"/>
        <path d="M0.825354 94.0992 4.65038 94.0992C5.1244 94.0992 5.50867 94.488 5.50867 94.9663L5.50236 103.941C5.50189 104.882 4.74721 105.645 3.81615 105.645L0 105.645 0 95.0577C-0.0492378 94.5811 0.293039 94.1543 0.764496 94.1046 0.785836 94.1028 0.80725 94.101 0.828696 94.101Z" fill="#999999" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 529.213 -6.10011)"/>
        <path d="M0.825354 94.0992 4.65038 94.0992C5.1244 94.0992 5.50867 94.488 5.50867 94.9663L5.50236 103.941C5.50189 104.882 4.74721 105.645 3.81615 105.645L0 105.645 0 95.0577C-0.0492378 94.5811 0.293039 94.1543 0.764496 94.1046 0.785836 94.1028 0.80725 94.101 0.828696 94.101Z" fill="#999999" fill-rule="evenodd" fill-opacity="0.5" transform="matrix(1.00301 0 0 1 529.213 -6.10011)"/>
        <path d="M0 103.941 0 100.205 50.3111 100.205 50.3111 103.941C50.3107 104.882 49.5559 105.645 48.6249 105.645L1.68621 105.645C0.755147 105.645 0.000470021 104.882-2.11591e-08 103.941Z" fill="#949494" fill-rule="evenodd" transform="matrix(1.00301 0 0 1 484.27 -6.01038)"/>
      </g>
    </g>
    <g clip-path="url(#clip28)" transform="matrix(0.000104987 0 0 0.000104987 -6 327)">
      <g clip-path="url(#clip30)" transform="scale(1.00902 1)">
        <use width="100%" height="100%" xlink:href="#img29" transform="scale(8597.79 8597.79)"/>
      </g>
    </g>
  </g>
</svg>
) Aprenderemos a:

  1. Configurar el entorno: Iniciar una sesión de Spark con el conector oficial de MongoDB.

  2. Conectar con MongoDB Atlas: Usaremos una base de datos en la nube para un ejemplo realista.

  3. Poblar MongoDB: Insertaremos datos de ejemplo directamente desde el notebook.

  4. Leer datos: Cargar una colección de MongoDB en un DataFrame de Spark.

  5. Analizar datos: Utilizar el poder de Spark SQL y las operaciones de DataFrame para procesar la información.

  6. Escribir datos: Guardar los resultados de nuestro análisis de vuelta en una nueva colección en MongoDB.

1. Configuración del Entorno#

Esta es la parte más importante. Necesitamos tres cosas:

  1. Una base de datos MongoDB: Usaremos el servicio gratuito de MongoDB Atlas.

  2. Las librerías necesarias: pyspark para Spark y pymongo para interactuar con MongoDB.

  3. El Conector de Spark para MongoDB: Un paquete que le permite a Spark comunicarse con MongoDB.

1.1. Pasos para Configurar MongoDB Atlas (¡Acción Requerida!)#

Si no tienes una cuenta, sigue estos pasos (toma 5-10 minutos):

  1. Ve a MongoDB Atlas y regístrate.

  2. Crea un clúster gratuito (Tier M0). Puedes elegir cualquier proveedor de nube y región.

  3. Crea un usuario de base de datos: En la sección «Database Access», crea un usuario. Guarda bien el nombre de usuario y la contraseña.

  4. Configura el acceso de red: En la sección «Network Access», agrega tu dirección IP actual o permite el acceso desde cualquier lugar (0.0.0.0/0 - solo para este tutorial, no es seguro para producción).

  5. Obtén la cadena de conexión: Ve a «Database», haz clic en «Connect» en tu clúster, selecciona «Connect your application» y copia la cadena de conexión. Se verá algo así: mongodb+srv://<username>:<password>@clustername.mongodb.net/.

# =================================================================
# 1.2. Instalar las dependencias
# =================================================================
!pip install pyspark pymongo -q
?25l   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 0.0/1.7 MB ? eta -:--:--
   ━━╺━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 0.1/1.7 MB 3.6 MB/s eta 0:00:01
   ━━━━━━━━━━━━╸━━━━━━━━━━━━━━━━━━━━━━━━━━━ 0.6/1.7 MB 8.2 MB/s eta 0:00:01
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╸ 1.7/1.7 MB 16.6 MB/s eta 0:00:01
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 14.1 MB/s eta 0:00:00
?25h?25l   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 0.0/331.1 kB ? eta -:--:--
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 331.1/331.1 kB 22.5 MB/s eta 0:00:00
?25h
# =================================================================
# 1.3. Iniciar la Sesión de Spark con el Conector de MongoDB
# =================================================================
from pyspark.sql import SparkSession

# Reemplaza esta versión si es necesario. Búscala en el repositorio de Maven.
mongo_spark_connector_version = "3.0.1"
mongo_spark_connector = f"org.mongodb.spark:mongo-spark-connector_2.12:{mongo_spark_connector_version}"

spark = SparkSession.builder\
    .appName("SparkMongoDBIntegration")\
    .config("spark.jars.packages", mongo_spark_connector)\
    .getOrCreate()

print("SparkSession iniciada con el conector de MongoDB.")
SparkSession iniciada con el conector de MongoDB.

2. Poblando MongoDB con Datos de Ejemplo#

Para que nuestro notebook sea autocontenido, vamos a insertar algunos datos en nuestra base de datos de Atlas usando la librería pymongo. Esto también nos sirve para verificar que nuestra cadena de conexión es correcta.

import pymongo

# =================================================================
# ¡ACCIÓN REQUERIDA!
# Reemplaza la cadena de conexión con la tuya de MongoDB Atlas.
# Asegúrate de poner tu usuario y contraseña.
# =================================================================
# =================================================================
# ¡ACCIÓN REQUERIDA!
# Reemplaza la cadena de conexión con la tuya de MongoDB Atlas.
# =================================================================
MONGO_URI = ""
# Definimos el nombre de nuestra base de datos y colección
DB_NAME = "universidad"
COLLECTION_NAME = "cientificos"

# Datos de ejemplo que vamos a insertar
cientificos_data = [
    {"nombre": "Albert", "apellido": "Einstein", "campo": "Física", "nacimiento": 1879},
    {"nombre": "Marie", "apellido": "Curie", "campo": "Química", "nacimiento": 1867},
    {"nombre": "Isaac", "apellido": "Newton", "campo": "Física", "nacimiento": 1643},
    {"nombre": "Charles", "apellido": "Darwin", "campo": "Biología", "nacimiento": 1809},
    {"nombre": "Rosalind", "apellido": "Franklin", "campo": "Química", "nacimiento": 1920}
]

# Conectamos a MongoDB y poblamos la colección
try:
    client = pymongo.MongoClient(MONGO_URI)
    db = client[DB_NAME]
    collection = db[COLLECTION_NAME]

    # Limpiamos la colección por si ya existían datos
    collection.delete_many({})

    # Insertamos los nuevos datos
    collection.insert_many(cientificos_data)

    print(f"✅ Datos insertados correctamente en la colección '{COLLECTION_NAME}'.")
    client.close()
except Exception as e:
    print(f"❌ Error al conectar o insertar datos: {e}")
✅ Datos insertados correctamente en la colección 'cientificos'.

3. Leer Datos de MongoDB con Spark#

Ahora que tenemos datos en MongoDB, vamos a usar Spark para leerlos y cargarlos en un DataFrame. Un DataFrame es una tabla de datos distribuida con columnas nombradas.

# Leemos los datos desde MongoDB a un DataFrame de Spark
df_cientificos = spark.read.format("mongo")\
    .option("uri", MONGO_URI)\
    .option("database", DB_NAME)\
    .option("collection", COLLECTION_NAME)\
    .load()

# Mostramos el esquema inferido por Spark
print("Esquema del DataFrame:")
df_cientificos.printSchema()

# Mostramos los datos cargados
print("\nDatos cargados desde MongoDB:")
df_cientificos.show()
Esquema del DataFrame:
root
 |-- _id: struct (nullable = true)
 |    |-- oid: string (nullable = true)
 |-- apellido: string (nullable = true)
 |-- campo: string (nullable = true)
 |-- nacimiento: integer (nullable = true)
 |-- nombre: string (nullable = true)


Datos cargados desde MongoDB:
+--------------------+--------+--------+----------+--------+
|                 _id|apellido|   campo|nacimiento|  nombre|
+--------------------+--------+--------+----------+--------+
|{68ccb513e7510f01...|Einstein|  Física|      1879|  Albert|
|{68ccb513e7510f01...|   Curie| Química|      1867|   Marie|
|{68ccb513e7510f01...|  Newton|  Física|      1643|   Isaac|
|{68ccb513e7510f01...|  Darwin|Biología|      1809| Charles|
|{68ccb513e7510f01...|Franklin| Química|      1920|Rosalind|
+--------------------+--------+--------+----------+--------+

4. Analizar los Datos con Spark#

Una vez que los datos están en un DataFrame de Spark, podemos usar todo su poder para el análisis. Realizaremos un análisis simple: encontrar a los científicos nacidos en el siglo XIX y agruparlos por campo.

from pyspark.sql.functions import count

# Filtramos los científicos nacidos en el siglo XIX (entre 1801 y 1900)
df_siglo_xix = df_cientificos.filter(
    (df_cientificos.nacimiento > 1800) & (df_cientificos.nacimiento <= 1900)
)

print("Científicos nacidos en el siglo XIX:")
df_siglo_xix.show()

# Agrupamos por campo y contamos cuántos hay en cada uno
df_conteo_por_campo = df_siglo_xix.groupBy("campo").agg(
    count("*").alias("numero_de_cientificos")
)

print("\nConteo de científicos del siglo XIX por campo:")
df_conteo_por_campo.show()
Científicos nacidos en el siglo XIX:
+--------------------+--------+--------+----------+-------+
|                 _id|apellido|   campo|nacimiento| nombre|
+--------------------+--------+--------+----------+-------+
|{68ccb513e7510f01...|Einstein|  Física|      1879| Albert|
|{68ccb513e7510f01...|   Curie| Química|      1867|  Marie|
|{68ccb513e7510f01...|  Darwin|Biología|      1809|Charles|
+--------------------+--------+--------+----------+-------+


Conteo de científicos del siglo XIX por campo:
+--------+---------------------+
|   campo|numero_de_cientificos|
+--------+---------------------+
|  Física|                    1|
| Química|                    1|
|Biología|                    1|
+--------+---------------------+

5. Escribir los Resultados de Vuelta a MongoDB#

Finalmente, guardaremos el resultado de nuestro análisis (el DataFrame df_conteo_por_campo) en una nueva colección en MongoDB.

# Definimos el nombre de la nueva colección para los resultados
RESULTS_COLLECTION_NAME = "conteo_por_campo"

# Escribimos el DataFrame de resultados en MongoDB
# El modo "overwrite" borrará la colección si ya existe y la creará de nuevo.
df_conteo_por_campo.write.format("mongo")\
    .option("uri", MONGO_URI)\
    .option("database", DB_NAME)\
    .option("collection", RESULTS_COLLECTION_NAME)\
    .mode("overwrite")\
    .save()

print(f"✅ Resultados guardados en la colección '{RESULTS_COLLECTION_NAME}'.")

# Verificación final usando pymongo para confirmar que los datos se escribieron
try:
    client = pymongo.MongoClient(MONGO_URI)
    db = client[DB_NAME]
    results_collection = db[RESULTS_COLLECTION_NAME]
    print("\nVerificando los datos escritos en MongoDB:")
    for doc in results_collection.find():
        print(doc)
    client.close()
except Exception as e:
    print(f"❌ Error al verificar los datos: {e}")

# Detenemos la sesión de Spark
spark.stop()
✅ Resultados guardados en la colección 'conteo_por_campo'.

Verificando los datos escritos en MongoDB:
{'_id': ObjectId('68ccb54c98f4a667c551597b'), 'campo': 'Física', 'numero_de_cientificos': 1}
{'_id': ObjectId('68ccb54c98f4a667c551597c'), 'campo': 'Química', 'numero_de_cientificos': 1}
{'_id': ObjectId('68ccb54c98f4a667c551597d'), 'campo': 'Biología', 'numero_de_cientificos': 1}

#Ejemplo con datos reales

Vamos a reemplazar los datos de los científicos con un conjunto de datos público muy popular: Títulos de Netflix. Este dataset contiene información sobre películas y shows de TV, su tipo, país de origen, año de lanzamiento, etc. Es perfecto para realizar agregaciones y filtros realistas.

Integración de Apache Spark y MongoDB con PySpark: Caso Práctico con Datos de Netflix#

Objetivo del Notebook: Aprender a conectar Apache Spark con una base de datos MongoDB para realizar operaciones de lectura, análisis y escritura. Este es un patrón muy común en arquitecturas de Big Data, donde MongoDB se utiliza como un sistema de almacenamiento flexible y Spark se usa para el procesamiento a gran escala.

Aprenderemos a:

  1. Configurar el entorno: Iniciar una sesión de Spark con el conector oficial de MongoDB.

  2. Conectar con MongoDB Atlas: Usaremos una base de datos en la nube para un ejemplo realista.

  3. Poblar MongoDB: Descargaremos un dataset real (Títulos de Netflix) y lo insertaremos en nuestra base de datos.

  4. Leer datos: Cargar la colección de Netflix en un DataFrame de Spark.

  5. Analizar datos: Utilizar el poder de Spark para realizar agregaciones y filtros sobre los datos.

  6. Escribir datos: Guardar los resultados de nuestro análisis de vuelta en una nueva colección en MongoDB.

# =================================================================
# Paso 1: Instalar las dependencias
# =================================================================
!pip install pyspark pymongo pandas -q
# =================================================================
# Paso 2: Iniciar la Sesión de Spark con el Conector de MongoDB
# =================================================================
from pyspark.sql import SparkSession

# Versión del conector de Spark para MongoDB
mongo_spark_connector_version = "3.0.1"
mongo_spark_connector = f"org.mongodb.spark:mongo-spark-connector_2.12:{mongo_spark_connector_version}"

spark = SparkSession.builder\
    .appName("SparkMongoDBNetflix")\
    .config("spark.jars.packages", mongo_spark_connector)\
    .getOrCreate()

print("✅ SparkSession iniciada con el conector de MongoDB.")
✅ SparkSession iniciada con el conector de MongoDB.

3. Poblando MongoDB con el Dataset de Netflix#

Ahora, vamos a descargar el dataset de Netflix, procesarlo con la librería Pandas y subirlo a nuestra base de datos en MongoDB Atlas.

import pymongo
import pandas as pd
import requests
from io import StringIO

# =================================================================
# ¡ACCIÓN REQUERIDA!
# Reemplaza la cadena de conexión con la tuya de MongoDB Atlas.
# Asegúrate de poner tu usuario y contraseña.
# =================================================================
MONGO_URI = ""

# Definimos el nombre de nuestra base de datos y colección
DB_NAME = "netflix"
COLLECTION_NAME = "titles"

# URL del dataset de Netflix en formato CSV
url = "https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2021/2021-04-20/netflix_titles.csv"

print("📥 Descargando el dataset de Netflix...")
try:
    # Descargar el contenido del archivo CSV
    response = requests.get(url)
    response.raise_for_status() # Lanza un error si la descarga falla

    # Leer el CSV en un DataFrame de Pandas
    csv_data = StringIO(response.text)
    df_pandas = pd.read_csv(csv_data)

    # Limpieza básica: Reemplazar valores nulos (NaN) para evitar errores en MongoDB
    df_pandas.fillna("", inplace=True)

    # Convertir el DataFrame a una lista de diccionarios (formato JSON)
    data_to_insert = df_pandas.to_dict('records')

    print(f"📄 Se leyeron {len(data_to_insert)} documentos del archivo.")

    # Conectar a MongoDB y poblar la colección
    print("⏳ Conectando a MongoDB Atlas y poblando la colección...")
    client = pymongo.MongoClient(MONGO_URI)
    db = client[DB_NAME]
    collection = db[COLLECTION_NAME]

    # Limpiamos la colección por si ya existían datos
    collection.delete_many({})

    # Insertamos los nuevos datos
    collection.insert_many(data_to_insert)

    print(f"✅ ¡Éxito! {len(data_to_insert)} documentos insertados en la colección '{COLLECTION_NAME}'.")
    client.close()

except Exception as e:
    print(f"❌ Ocurrió un error: {e}")
📥 Descargando el dataset de Netflix...
📄 Se leyeron 7787 documentos del archivo.
⏳ Conectando a MongoDB Atlas y poblando la colección...
✅ ¡Éxito! 7787 documentos insertados en la colección 'titles'.

4. Leer Datos de MongoDB con Spark#

Con los datos ya en MongoDB, usaremos Spark para leer la colección completa y cargarla en un DataFrame distribuido.

# Leemos los datos desde la colección de Netflix
df_netflix = spark.read.format("mongo")\
    .option("uri", MONGO_URI)\
    .option("database", DB_NAME)\
    .option("collection", COLLECTION_NAME)\
    .load()

print("📝 Esquema del DataFrame de Netflix:")
df_netflix.printSchema()

print("\n🎬 Algunos datos cargados desde la colección de Netflix:")
df_netflix.show(5)
📝 Esquema del DataFrame de Netflix:
root
 |-- _id: struct (nullable = true)
 |    |-- oid: string (nullable = true)
 |-- cast: string (nullable = true)
 |-- country: string (nullable = true)
 |-- date_added: string (nullable = true)
 |-- description: string (nullable = true)
 |-- director: string (nullable = true)
 |-- duration: string (nullable = true)
 |-- listed_in: string (nullable = true)
 |-- rating: string (nullable = true)
 |-- release_year: integer (nullable = true)
 |-- show_id: string (nullable = true)
 |-- title: string (nullable = true)
 |-- type: string (nullable = true)


🎬 Algunos datos cargados desde la colección de Netflix:
+--------------------+--------------------+-------------+-----------------+--------------------+-----------------+---------+--------------------+------+------------+-------+-----+-------+
|                 _id|                cast|      country|       date_added|         description|         director| duration|           listed_in|rating|release_year|show_id|title|   type|
+--------------------+--------------------+-------------+-----------------+--------------------+-----------------+---------+--------------------+------+------------+-------+-----+-------+
|{68ccb6974b8b31bd...|João Miguel, Bian...|       Brazil|  August 14, 2020|In a future where...|                 |4 Seasons|International TV ...| TV-MA|        2020|     s1|   3%|TV Show|
|{68ccb6974b8b31bd...|Demián Bichir, Hé...|       Mexico|December 23, 2016|After a devastati...|Jorge Michel Grau|   93 min|Dramas, Internati...| TV-MA|        2016|     s2| 7:19|  Movie|
|{68ccb6974b8b31bd...|Tedd Chan, Stella...|    Singapore|December 20, 2018|When an army recr...|     Gilbert Chan|   78 min|Horror Movies, In...|     R|        2011|     s3|23:59|  Movie|
|{68ccb6974b8b31bd...|Elijah Wood, John...|United States|November 16, 2017|In a postapocalyp...|      Shane Acker|   80 min|Action & Adventur...| PG-13|        2009|     s4|    9|  Movie|
|{68ccb6974b8b31bd...|Jim Sturgess, Kev...|United States|  January 1, 2020|A brilliant group...|   Robert Luketic|  123 min|              Dramas| PG-13|        2008|     s5|   21|  Movie|
+--------------------+--------------------+-------------+-----------------+--------------------+-----------------+---------+--------------------+------+------------+-------+-----+-------+
only showing top 5 rows

5. Analizar los Datos con Spark#

Ahora realizaremos un análisis sobre el DataFrame:

  1. Contar cuántas producciones son Películas (Movie) vs. Shows de TV (TV Show).

  2. Encontrar el Top 5 de países con la mayor cantidad de producciones en Netflix.

from pyspark.sql.functions import col, desc

# 1. Conteo por tipo (Movie vs. TV Show)
df_conteo_tipo = df_netflix.groupBy("type").count()

print("📊 Conteo de producciones por tipo:")
df_conteo_tipo.show()


# 2. Top 5 países con más producciones
# Filtramos para excluir registros donde el país no está especificado
df_conteo_pais = df_netflix.filter(col("country") != "")\
    .groupBy("country")\
    .count()\
    .orderBy(desc("count"))\
    .limit(5)

print("\n🏆 Top 5 países con más producciones en Netflix:")
df_conteo_pais.show()
📊 Conteo de producciones por tipo:
+-------+-----+
|   type|count|
+-------+-----+
|TV Show| 2410|
|  Movie| 5377|
+-------+-----+


🏆 Top 5 países con más producciones en Netflix:
+--------------+-----+
|       country|count|
+--------------+-----+
| United States| 2555|
|         India|  923|
|United Kingdom|  397|
|         Japan|  226|
|   South Korea|  183|
+--------------+-----+

6. Escribir Resultados de Vuelta a MongoDB#

Finalmente, guardaremos el resultado de nuestro análisis (el Top 5 de países) en una nueva colección en MongoDB para que pueda ser consultado por otras aplicaciones.

# Definimos el nombre de la nueva colección para los resultados
RESULTS_COLLECTION_NAME = "top_paises_productores"

# Escribimos el DataFrame de resultados en MongoDB
# El modo "overwrite" borrará la colección si ya existe y la creará de nuevo.
df_conteo_pais.write.format("mongo")\
    .option("uri", MONGO_URI)\
    .option("database", DB_NAME)\
    .option("collection", RESULTS_COLLECTION_NAME)\
    .mode("overwrite")\
    .save()

print(f"✅ Resultados guardados en la colección '{RESULTS_COLLECTION_NAME}'.")

# Verificación final usando pymongo para confirmar que los datos se escribieron
try:
    client = pymongo.MongoClient(MONGO_URI)
    db = client[DB_NAME]
    results_collection = db[RESULTS_COLLECTION_NAME]
    print("\n🧐 Verificando los datos escritos en MongoDB:")
    for doc in results_collection.find():
        print(doc)
    client.close()
except Exception as e:
    print(f"❌ Error al verificar los datos: {e}")

# Detenemos la sesión de Spark para liberar recursos
spark.stop()
✅ Resultados guardados en la colección 'top_paises_productores'.

🧐 Verificando los datos escritos en MongoDB:
{'_id': ObjectId('68ccb6b9a0f4ab6fc60db1ae'), 'country': 'United States', 'count': 2555}
{'_id': ObjectId('68ccb6b9a0f4ab6fc60db1af'), 'country': 'India', 'count': 923}
{'_id': ObjectId('68ccb6b9a0f4ab6fc60db1b0'), 'country': 'United Kingdom', 'count': 397}
{'_id': ObjectId('68ccb6b9a0f4ab6fc60db1b1'), 'country': 'Japan', 'count': 226}
{'_id': ObjectId('68ccb6b9a0f4ab6fc60db1b2'), 'country': 'South Korea', 'count': 183}