Valence is seeking an exceptional Data Scientist to join our growing team within our R&D department, Valence Labs.
In this role, you will leverage your analytical and technical skills to extract powerful insights from complex datasets.
You will build models that enhance our understanding of data networks and user behavior to drive product and business strategy.
As our Data Scientist, you will work closely with stakeholders across the company to identify opportunities where advanced analytics can solve problems and create value.
You will synthesize conclusions from large-scale data analysis and communicate results to both technical and non-technical audiences. Your models will provide the intelligence guiding significant product and business decisions.
You will tackle highly ambiguous problems with creative thinking and rigorous methodology.
You will employ techniques like statistical analysis, machine learning, algorithm development, and data visualization to drive insights.
You will be passionate about communicating analytical narratives with clarity and purpose and adept at remote work environments and collaboration tools.
Responsibilities
1. Query Optimization: Optimize database queries, data retrieval processes, and data manipulation operations to improve performance and speed. Identify and eliminate bottlenecks, reduce latency, and enhance the efficiency of data access and retrieval through techniques such as query rewriting, indexing, caching, and parallel processing.
2. Value Attribution: Design and implement methodologies to attribute value accurately to different data sources, variables, or business activities. Develop statistical models and machine learning algorithms to measure and quantify the impact of various factors on business outcomes. Collaborate with business stakeholders to understand their value attribution requirements and provide data-driven insights and recommendations.
3. Machine Learning and Statistical Modeling: Apply statistical analysis and machine learning techniques to gain insights from large datasets, including feature selection, regression, classification, and clustering. Develop predictive models and algorithms to identify patterns, trends, and anomalies in data to enhance query performance and optimize data retrieval.
4. Data Infrastructure Improvement: Collaborate with the data engineering team to design, develop, and optimize data infrastructure components, including data pipelines, storage systems, and data integration processes. Provide guidance and expertise in leveraging modern data processing technologies to improve the efficiency of data retrieval and analysis.
5. Research and Innovation: Stay up to date with the latest advancements in cryptography, query optimization techniques, and data science methodologies. Conduct research and experiments to explore novel approaches and tools that can enhance data security and accelerate query performance.
Qualifications
– The most important is to have as many of the following as possible:
1. High-performance computing research
2. LLM experience
3. Adtech experience
4. Research experience/advanced degree
– Bachelor’s or Master’s degree in Computer Science, Data Science, Statistics, or a related field. A Ph.D. is a plus.
– Solid understanding of cryptography principles, protocols, and algorithms, such as symmetric and asymmetric encryption, hash functions, digital signatures, and secure key exchange.
– Proficiency in query optimization techniques and experience with optimizing database performance using tools like indexes, query rewriting, caching, and parallel processing.
– Strong programming skills in languages like Python, R, or SQL, with experience in data manipulation, analysis, and visualization libraries (e.g., pandas, NumPy, matplotlib).
– Experience with machine learning techniques, statistical modeling, and data mining algorithms.
– Familiarity with modern data processing frameworks and tools such as Hadoop, Spark, or distributed databases.
– Excellent problem-solving and analytical thinking skills, with the ability to quickly understand and tackle complex data-related challenges.
– Strong communication and collaboration skills, with the ability to work effectively in a team environment and present complex concepts to both technical and non-technical stakeholders.
About Valence
Valence supports open data networks by providing infrastructure to facilitate valuable data exchange between partners. Our solutions empower the insights that are transforming business, research, and society.
About the TeamOur research hub, Valence Labs, comprises dedicated data enthusiasts motivated by data algorithms and machine learning to unlock discoveries with real world impacts. We constantly expand our thinking by learning new methods and technologies. We hold ourselves to high standards of rigor and quality in our work. Collaboration is key – we teach each other, review each other’s work, and push the boundaries of what’s possible together. We’re united by curiosity and a shared belief in data’s power to illuminate.