(While navigating through the site, please be sure to disable your pop-up blocker.)
Data Scientist, Spatial Omics
The University of Texas MD Anderson Cancer Center has the potential to unlock the power of data by further developing and investing in talent, team science and infrastructure to optimize multidimensional data integration, analysis and application for the benefit of patients with cancer. The Institute for Data Science in Oncology (IDSO) is a signature priority program aimed at transforming the patient experience, enhancing quality of life and accelerating scientific breakthroughs via advanced, data-driven approaches to cancer care. IDSO will enable teams to search for, learn from and apply as much information as possible gathered from every patient MD Anderson has seen or will see. By growing a data-centric culture and advancing data management and analytics maturity, we provide better, state-of-science and state-of-data-science care for patients while exploring areas of cancer research and treatment currently unknown to clinical communities.
The IDSO recruits, positions and enables best-in-class data scientists to unravel seemingly insoluble problems in cancer and work toward meaningful solutions for patients. Our aims include reducing the time between diagnostic procedures and treatment decisions, advancing drug discovery efforts and bringing new, precise medicines to bedsides sooner. The IDSO centralizes our focused institutional investment in data science, as well as enables partnerships with other world-leading organizations, to operate and enhance an unprecedented oncological “data supply chain” designed to accelerate research and treatment innovation. Comprised of the best minds in a myriad of scientific and data-driven fields, the IDSO facilitates a culture grounded in our innovative “team-data science” principles, such as shared motivation, shared learning, provenance linking insights to observations and integrated data governance.
One of the five focus areas IDSO has prioritized is Focus Area 2 - Single Cell Spatial Omics to Unveil Cancer Complexity, which is co-led by Dr. Linghua Wang, an Associated Professor in the department of Genomic Medicine at MD Anderson Cancer Center and a recognized computational biologist and leader in single-cell spatial research in cancer. Our goal is to build a world-class data science hub for single-cell and spatial omics studies to accelerate the discovery and translation of cancer research at MD Anderson and beyond.
* Benchmark computational methods and toolkits to ensure best practices and excellence in data processing and analysis.
* Develop analytical approaches as needed for new technologies and integrate them with existing platforms.
* Ensure scalability and efficiency of the pipelines to handle large and complex datasets.
* Conduct rigorous testing and validation of computational tools to ensure accuracy and reproducibility.
* Develop interactive web portals for effective data exploration and visualization.
* Work with teams to integrate data and metadata, assisting postdocs and students in analyzing and resolving issues related to data analysis as needed.
* Lead hands-on training workshops, courses, and seminars.
* Assist with manuscript and grant preparation and presentations, and present at lab meetings, departmental and institutional seminars, and appropriate scientific conferences.
Other Duties:
* Attend collaborator meetings and team working group meetings. Prioritize and manage multiple projects in a timely and resource-effective manner.
* Stay up-to-date with relevant literature, gather information systematically, and confer with the supervisor regarding new procedures.
* Experience with best practices in spatial omics data analysis, including multimodal data integration.
* Demonstrated ability to develop, optimize, and maintain bioinformatics pipelines.
* Proficiency in at least two or more programming languages, such as R and Python.
* Expertise in modern workflow languages, such as Nextflow, WDL, or CWL for scalable, reproducible bioinformatics pipelines.
* Experience with containerization technologies, including Docker, Docker Swarm, and Kubernetes, for creating reproducible and scalable environments.
* Proficiency in version control systems (e.g., Git, and GitHub) and Linux system administration.
* Familiarity with FAIR principles (Findable, Accessible, Interoperable, and Reusable) for data and tool development to ensure reproducibility and accessibility of scientific workflows.
* Strong attention to detail, with the ability to multi-task and a willingness to learn new methods and technologies.
* Excellent written and verbal communication skills.
* Ability to work effectively in a collaborative and interdisciplinary team environment (strong plus).
* Knowledge of high-performance computing (HPC) environments is a plus.
WORKING CONDITIONS
|
Frequency |
Deadlines |
-- |
|
Frequency |
Weight |
Sitting |
Constant 67-100% |
-- |
Analytical Ability |
EDUCATION:
Required: Bachelor's degree in Biomedical Engineering, Electrical Engineering, Computer Engineering, Physics, Applied Mathematics, Science, Engineering, Computer Science, Statistics, Computational Biology, or related field.
Preferred: Master's degree in Biomedical Engineering, Electrical Engineering, Computer Engineering, Physics, Applied Mathematics, Science, Engineering, Computer Science, Statistics, Computational Biology, or related field.
EXPERIENCE:
Required: Three years experience in scientific software or industry development/analysis. With Master's degree, one years experience required. With PhD, no experience required.
Preferred: With Master's degree 5 years with best practices in spatial omics data analysis, including multimodal data integration. With PhD, 3 years with best practices in spatial omics data analysis, including multimodal data integration.
It is the policy of The University of Texas MD Anderson Cancer Center to provide equal employment opportunity without regard to race, color, religion, age, national origin, sex, gender, sexual orientation, gender identity/expression, disability, protected veteran status, genetic information, or any other basis protected by institutional policy or by federal, state or local laws unless such distinction is required by law. http://www.mdanderson.org/about-us/legal-and-policy/legal-statements/eeo-affirmative-action.html