Java Software Developer - Data Lake Engineering

Location(s) SE-STK-Stockholm
Job ID
Schedule Type
Full Time
Vice President/Executive Director, Associate, Analyst
Software Engineer
Business Unit
Data Lake
Employment Type



Goldman Sachs is a leading global investment banking, securities and investment management firm that provides a wide range of services worldwide to a substantial and diversified client base that includes corporations, financial institutions, governments and high net-worth individuals.

Founded in 1869, it is one of the oldest and largest investment banking firms. The firm is headquartered in New York and maintains offices in London, Bangalore, Frankfurt, Tokyo, Hong Kong and other major financial centres around the world.

We are committed to growing our distinctive Culture and holding to our core values which always place our client's interests first. These values are reflected in our Business Principles, which emphasise integrity, commitment to excellence, innovation and teamwork.


We are building out our Stockholm Data Engineering team, this is a unique opportunity to be part of starting up a EMEA data engineering centre of excellence in a global Tier 1 Investment Bank.


The GS Data Lake is an industry leading cutting edge data lake implantation that is leveraged by majority of GS business units. Our team of engineers build solutions to the most complex
problems leveraging distributed technology stacks as Hadoop, Apache Spark, Kafka, Elastic Search, Flink in combination of flora of warehouse technologies as Sybase IQ, IBM DB2, MemSQL, Snowflake, S3 and more to comm.

Technology teams across the Firm are clients, participating in providing and consuming data to & from the lake. Developers on the team create and manage the software that manages the data in the lake, ensure entitlements are enforced appropriately, data is milestoned, and is available for query on multiple target warehouse platforms. This platform enables structuring, management, integration, control, discovery, usage, and governance of our Data Assets.

Data Lake Engineering is comprised of 3 teams. Our recruiting process is aligned to find the right fit for you within these three areas of focus and we do want you to learn about each of them in order to align your interests with available roles. If you believe that you have a targeted interest, please do say so.

Data Lake Core Development is an Engineering Team that focuses on the continues development on the lake to optimize it, add features and continue to maintain its cutting edge status. Core skills: Top Tier Java developer, experience from large distributed systems, working experience from one or more of the following frameworks: Hadoop, Spark, Flink, Kafkha, IQ, DB2, S3, Snowflake.


  • Develop cutting edge technology stack using latest frameworks available
  • Solve hard complex distributed problems
  • Drive hybrid cloud integration
  • Drive integration of the latest and greatest warehouse technologies

Data Lake Reliability Engineering team is a data driven engineering team who’s focus Is to engineer solutions around stability, scalability and system optimization for the Lake. A key part of the role is to build infrastructure to measure and predict everything from client behaviors to future capacity needs. Core skills are: Python or Java, applied statistics and fundamental data science / ML principles/SRE Principals.

  • Develop cutting edge technology stack to measure big data platform and enable a metric
    driven platform management strategy
  • Continues metric driven work to improve, optimize and harden our datalake for continues
    growth and usage
  • Transform reactive process to proactive process through engineering
  • Apply data science practices to understand platform and user behaviours
  • Apply machine learning practices to automate and optimize
  • Problem solving, creativity and ownership will empower you to deliver data products endto-end.
  • Driven by business impact and customer centricity, you will strive for excellence through incremental improvements.

Data Lake Solutions architect engage the firm’s Engineering teams (our clients) to understand their big data needs and help them architect technical solutions that leverage the features and functionality of the Data Lake, the Firm’s strategic repository for enterprise data. They are functionally knowledgeable about the Lake’s current and planned features as well as technically adept at identifying and resolving simple issues in user’s configurations and code. They also build tools and scripts that enable the team to more efficiently serve our clients. Core skills: Apache Hadoop, Spark and Flink as well as development experience with Java and Python.

  • Individuals who can provide strategic thinking and recommend technical solutions to engineering teams across the firm engaged in big data initiatives
  • You will work closely with the development team to understand the Data Lake offering
    and be a technical subject matter expert which you will apply to guide our clients to effectively use the Data Lake platform
  • You will help to remediate user’s configuration, stability and performance issues through your hands-on experience with the Data Lake


Data Lake Developer/Reliability Engineer/Solution Architect


Basic Qualifications All Roles

  • 3+ years of experience with Java or Python with a good grasp of development, Object Oriented Analysis and Design and testing best practices.
  • Good understanding of distributed systems
  • Working knowledge of scripting languages, Linux, Networking protocols, security and file systems
  • Strong technical skills, analytical mindset, self-motivated, independent, creative, can solve interesting and sometimes difficult technical problems under time pressure and resource constraints
  • Commercially focused; seeks to understand the requirements and how they will benefit our clients, stakeholders, and business
  • Experience with all stages in the development lifecycle: inception, analysis, design, review, testing, and deployment
  • Good sense of user interaction and usability design to provide an intuitive, seamless end user experience.
  • Experience building and sustaining long-term relationships with clients and colleagues in a diverse global organization
  • Judgment to prioritize and escalate issues in order to influence objectives and outcomes.
  • Excellent written and verbal communication skills, including experience working directly with both technical and non-technical stakeholders
Data Lake Core Development Qualifications
  • Experience with developing Workflow, Spark, MapReduce, Flink, Elasticsearch, and Sybase IQ.

Data Lake Reliability Engineering Qualifications

  • Experience from driving change through metrics
  • Experience developing in Jupyter, Django, Flask, Angular
  • Experience driving change though applied statistics, Machine Learning / Data Science
  • Experience in fullstack n-dimensional incident management and root cause investigation
  • Familiarity of SRE principles
  • Preferred experience of one of the following: Hadoop, Spark, MapReduce, Flink, Elasticsearch, Sybase IQ, DB2, .


Data Lake Client Solutions Engineering Qualifications

  • 3+ years of software development and/or business management experience, with particular focus on business data management in a client/user facing role
  • Experience with User Interface development using JavaScript, preferably with AngularJS
  • Comfortable handling a large volume of emails and phone calls
  • A healthy obsession with customer satisfaction and the customer experience, and a willingness to be its internal advocate
  • The ability to self-direct as circumstances change and priorities shift, based on organizational needs, without constant & overt micromanagement
  • Ability to stay commercially focused and to always push for quantifiable commercial impact
  • Ability to collaborate effectively across global teams and communicate complex ideas in a simple manner
  • Extensive knowledge and proven experience applying domain driven design to build complex business applications or processes



At Goldman Sachs, we commit our people, capital and ideas to help our clients, shareholders and the communities we serve to grow. Founded in 1869, we are a leading global investment banking, securities and investment management firm. Headquartered in New York, we maintain offices around the world.

We believe who you are makes you better at what you do. We're committed to fostering and advancing diversity and inclusion in our own workplace and beyond by ensuring every individual within our firm has a number of opportunities to grow professionally and personally, from our training and development opportunities and firmwide networks to benefits, wellness and personal finance offerings and mindfulness programs. Learn more about our culture, benefits, and people at

We’re committed to finding reasonable accommodations for candidates with special needs or disabilities during our recruiting process. Learn more:

© The Goldman Sachs Group, Inc., 2020. All rights reserved.
Goldman Sachs is an equal employment/affirmative action employer Female/Minority/Disability/Veteran/Sexual Orientation/Gender Identity