TC
Azure DataBricks Engineer
Tata Consultancy Services
Allen · On-site Full-time $120k – $140k/yr 1mo ago
About the role
Must Have Technical/Functional Skill
- Design, develop, and maintain cloud native data engineering solutions using Azure Databricks.
- Build and manage PySpark notebooks to process large scale structured and semi structured datasets.
- Design, create, and maintain Delta Lake tables, ensuring data reliability, ACID transactions, and schema enforcement.
- Develop scalable data workflows and pipelines using Databricks notebooks and orchestration patterns.
- Optimize performance of Spark jobs, including tuning partitions, memory usage, caching strategies, and query execution.
- Work extensively with PySpark and Spark SQL, choosing the appropriate approach based on use case and performance needs.
- Support cloud data migration initiatives, migrating data pipelines from on prem or legacy platforms to Azure Databricks.
- Integrate Databricks with upstream and downstream systems (e.g., data sources, storage layers, reporting tools).
- Ensure data pipelines are robust, reusable, and maintainable, following enterprise data engineering best practices.
- Implement error handling, logging, monitoring, and recovery strategies for production grade data pipelines.
- Collaborate with data architects, analysts, and downstream consumers to understand data requirements.
- Perform debugging and root cause analysis for data quality, performance, or pipeline failures.
- Support testing, validation, and reconciliation of data during development, migration, and production phases.
- Follow security, governance, and compliance standards applicable to cloud data platforms.
- Actively participate in Agile/Scrum delivery, owning data engineering stories from development through deployment.
- Maintain documentation for notebooks, workflows, data models, and migration approaches.
Roles & Responsibilities
- Develop and maintain data engineering solutions using Azure Databricks and PySpark.
- Create, enhance, and optimize Databricks notebooks for data ingestion, transformation, and aggregation.
- Design and manage Delta Lake tables and pipelines supporting analytics and reporting use cases.
- Support cloud data migrations, including data validation and performance benchmarking.
- Optimize Spark jobs for performance, scalability, and cost efficiency.
- Collaborate with platform, DevOps, and data governance teams to ensure environment stability.
- Perform data pipeline testing and validation, ensuring correctness and completeness.
- Troubleshoot and resolve issues related to Spark jobs, Delta tables, and workflow execution.
- Participate in code reviews and enforce data engineering best practices.
- Support production deployments and post deployment stabilization.
- Provide inputs to data architecture and platform improvement initiatives.
- Mentor junior data engineers when required.
Salary Range
$120,000-$140,000 Per year
TCS Employee Benefits Summary
- Discretionary Annual Incentive.
- Comprehensive Medical Coverage: Medical & Health, Dental & Vision, Disability Planning & Insurance, Pet Insurance Plans.
- Family Support: Maternal & Parental Leaves.
- Insurance Options: Auto & Home Insurance, Identity Theft Protection.
- Convenience & Professional Growth: Commuter Benefits & Certification & amp; Training Reimbursement.
- Time Off: Vacation, Time Off, Sick Leave & Holidays.
- Legal & Financial Assistance: Legal Assistance, 401K Plan, Performance Bonus, College Fund, Student Loan Refinancing.
Qualifications
BACHELOR OF COMPUTER SCIENCE
Skills
Azure DatabricksDelta LakePySparkSpark SQL
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free