
Data Scientist (Vector DB Engineer – Data Scientist
- Bangalore, Karnataka
- Permanent
- Full-time
- Design, implement, and manage vector databases to support large-scale data storage and retrieval, ensuring low latency and high availability.
- Develop efficient data models that facilitate fast vector operations such as similarity search, nearest neighbor search, and other vector-based queries.
- Optimize database performance through indexing, partitioning, sharding, and other techniques to handle large-scale datasets.
- Integrate vector databases with existing systems and applications, ensuring seamless data flow and accessibility.
- Design and implement solutions that scale with growing data volumes, ensuring the database infrastructure can handle increased load and complexity.
- Implement security best practices to protect data at rest and in transit, including encryption, access controls, and audit logging.
- Monitor database performance and troubleshoot issues as they arise, ensuring system reliability and availability.
- Work closely with data scientists, machine learning engineers, and software developers to understand their needs and provide database solutions that meet their requirements.
- Maintain comprehensive documentation for database schemas, configurations, and procedures to support operational excellence and knowledge sharing.
- Deep understanding and hands-on experience with vector databases, including their architecture, query languages, and optimization techniques.
- Strong programming skills in languages such as Python, C++, or Java, with experience in developing and optimizing database operations.
- Solid understanding of data structures, algorithms, and computational geometry, particularly related to vector search and similarity measures
- Experience with cloud platforms (e.g., AWS, GCP, Azure) and managed database services.
- Understanding of machine learning concepts, particularly those related to embedding vectors and similarity searches.
- Strong problem-solving skills with a focus on performance optimization and scalability.
- Excellent communication skills, with the ability to articulate complex technical concepts to non-technical stakeholders.
- A 5-year full-time education is required.
- This position requires candidate to work a 5-day -a -week schedule in the office
- Shift Timing- 01:00PM -10:00PM IST
- Knowledge of general accounting practices, Passion for Financial Reporting, Ability to learn and adapt quickly and a strong positive attitude.
- Maintaining stable performance under demanding business needs and support to the Business to the urgency.
Level Working Knowledge:
- Explains the basic decision process associated with specific statistics.
- Works with basic statistical functions on a spreadsheet or a calculator.
- Explains reasons for common statistical errors, misinterpretations, and misrepresentations.
- Describes characteristics of sample size, normal distributions, and standard deviation.
- Generates and interprets basic statistical data.
Level Extensive Experience:
- Evaluates and makes contributions to best practices.
- Processes large quantities of detailed information with high levels of accuracy.
- Productively balances speed and accuracy.
- Employs techniques for motivating personnel to meet or exceed accuracy goals.
- Implements a variety of cross-checking approaches and mechanisms.
- Demonstrates expertise in quality assurance tools, techniques, and standards.
Level Working Knowledge:
- Approaches a situation or problem by defining the problem or issue and determining its significance.
- Makes a systematic comparison of two or more alternative solutions.
- Uses flow charts, Pareto charts, fish diagrams, etc. to disclose meaningful data patterns.
- Identifies the major forces, events and people impacting and impacted by the situation at hand.
- Uses logic and intuition to make inferences about the meaning of the data and arrive at conclusions.
Level Working Knowledge:
- Completes specific tasks and initiatives utilizing machine learning technologies, such as search engine optimization.
- Utilizes specific tools and techniques to process descriptive and inferential statistics.
- Applies specific computing languages and tools in machine learning, such as R and Python.
- Explores to use machine learning in one own areas to make business improvements.
- Conducts data mining and cleaning initiatives.
Level Working Knowledge:
- Participates in the implementation and support of specialized programming languages.
- Conducts basic reviews on writing a specific programming language within a specific platform.
- Assists with the design and development of specialized programming languages.
- Follows an organization's standards, policies and guidelines for structured programming specifications.
- Diagnoses and reports minor or routine programming language problems.
Level Working Knowledge:
- Defines, creates and tests simple queries by using associated command language in a specific environment.
- Applies appropriate query tools used to connect to the data warehouse.
- Obtains and analyzes query access path information and query results.
- Employs tested query statements to retrieve, insert, update and delete information.
- Works with advanced features and functions including sorting, filtering and making simple calculations.
Level Working Knowledge:
- Follows policies, practices and standards for determining functional and informational requirements.
- Confirms deliverables associated with requirements analysis.
- Communicates with customers and users to elicit and gather client requirements.
- Participates in the preparation of detailed documentation and requirements.
- Utilizes specific organizational methods, tools and techniques for requirements analysis.
- Work Life Harmony
- Earned and medical leave.
- Relocation assistance
- Personal and professional development through Caterpillar ‘s employee resource groups across the globe
- Career developments opportunities with global prospects
- Medical coverage -Medical, life and personal accident coverage
- Employee mental wellness assistance program
- Employee investment plan
- Pay for performance -Annual incentive Bonus plan.