Project Management

Home / Discussions / Project Management Central /

How do you navigate unexpected data challenges in your projects?

Artificial Intelligence

Claudia Alcelay

PMI Team Member

Learning & Innovation Research Manager| Project Management Institute (PMI) Spain

Data quality and quantity is particularly important as we think about leveraging AI on projects. Considerations include the diversity and comprehensiveness of the data that is available to us.

Have you ever encountered unexpected challenges or pitfalls while using data in your projects? How did you navigate the situation and find a resolution?

Posted: Jan 11, 2024 2:20 PM

Sort By:

< 1 ... 7 8 9 10 11 12 13 14 15 16 17 ... 42 >

Anonymous

For the growth of artificial intelligence, information sharing is a priority. What is the boundary between data that is private and data that is public? This is a question I constantly ask myself.

Posted: Apr 11, 2024 12:56 PM

Anonymous

Data quality and quantity are critical when working with projects. We've encountered challenges like incomplete datasets or inconsistencies in data formats. In such situations, I've worked closely with our data team to identify the gaps and standardize the data. Concurrently we supplemented existing data with external sources. Doing this has helped us maintain the integrity of data achieve more reliable results.

Posted: Apr 12, 2024 1:36 PM

Louis Charlie YAKAN ABIDJAN, ALBERTA, Côte d'Ivoire

When faced with a critical incident or anomaly during a project, the primary responsibility falls to the Project Manager to make an important decision, sometimes immediately. The reputation of the company, the risks involved, the quality of services or data, the proposal of an effective and efficient solution, security, compliance... are prerequisites in decision-making. What we then recommend is adaptability and flexibility to change because in today's world new technologies require adaptation.

Posted: Apr 13, 2024 1:02 AM

KEREM ILHAMI BUGDAYCIOGLU Architect (M.Arch),Project Manager,BIM Consultant| Archimco (Personal Establishment ) Istanbul, Türkiye

Which Data Quality Management Software is appropriate currently?

Posted: Apr 13, 2024 12:55 PM

Sanghamitra Krishnamoorthy Senior Project Manager| Customer Value Partner Corporation Dunn Loring, Va, United States

Feb 01, 2024 4:13 PM

Replying to Verónica Elizabeth Pozo Ruiz

...

The main pitfalls that we can encounter when working with large amounts of data are inaccuracies, incoherency, and duplicated or outdated data. It's appropriate to use Data Quality Management Software to identify and correct these issues, and also monitor and synchronize data across the company, keeping database quality perfect.

Its always a struggle in large organizations to identify the true owner of data, and prevent replication / creation of outdated data. There are multiple vendors offering data management software - has anyone had any experience using or implementing Illumio ?

Posted: Apr 13, 2024 1:20 PM

Anonymous

I have yet experienced data pitfalls on my projects, I have best insight through this course on how to monitor and control data.
Thank you

Posted: Apr 13, 2024 1:41 PM

Ocholi Ausa Professional Graduate Programme| Smith's School of Business Queen's University Toronto Toronto, Canada

By considering a methodical approach to preparing it using standard best practices and frameworks that ensures that the data is cleaned, preprocessed, explored and certified fit for use before analysing or using it for its intended purpose. The quality and integrity of data impacts on its output and by implication decisions made off it. Analyzing, preprocessing and engaging in exploratory data analysis is an inevitable ritual for any practitioner or organisation who depends on it to deliver on a task.

Posted: Apr 13, 2024 6:24 PM

Daniela Zuppichin Director de Proyectos, Agile Trainer| SiPro360 Project Consulting Ciudad Autonoma de Buenos Aires, Argentina

The quality and quantity of data are absolutely crucial when working with artificial intelligence. Machine learning models are trained on large datasets, so the diversity and comprehensiveness of those datasets will greatly determine the performance and capabilities of the resulting model.
In my own projects, we encounter several challenges related to data:
Data bias: Often, available datasets may have internal biases, either from the way they were collected or from existing biases in the real world. This can lead to models training on those biases, resulting in unfair or inaccurate outcomes.
Noise in the data: Real-world data is often noisy and incomplete. Things like human errors, outliers, missing data, etc., can excessively hinder effective model training.
Data availability: In some domains, there simply isn't enough publicly available data to train robust models. Gathering new datasets can be a very costly and time-consuming process.
To mitigate these challenges, some strategies we use include:
Thorough data preprocessing: We invest a lot of effort in cleaning, filtering, and properly formatting the data before training. This helps handle noise and missing values.
Data augmentation techniques: When data is scarce, we apply techniques such as data translation, random cropping, etc., to generate additional synthetic data.
Using multiple data sources: We combine different public and private datasets to mitigate biases and increase diversity.
Bias evaluation: We conduct specific tests to assess and mitigate potential biases in the trained models.
In summary, working with high-quality data in large quantities remains one of the biggest challenges in AI. It requires a careful and proactive approach to ensure that models are fair, accurate, and robust. Unfortunately, many companies still do not consider investing proactively in this.

Regards from the end of the world!
DZ.-

Posted: Apr 14, 2024 11:07 AM

Mukesh Sengar Project Manager| XL IMPE INC, ATIKA TECHNOLOGIES Magnolia, Tx, United States

Addressing unexpected data challenges requires ongoing enhancements in our data governance policies and controls throughout the data lifecycle. This process begins with data collection from various sources, and based on data source discovery, realizing the data profiling, which involves collaboration between business analysts and data engineering experts. Furthermore, coordinating data consolidation & data agregation with the cordination of business stakeholders is crucial. Maintaining data integrity and implementing data security measures are essential to producing high-quality and consistent data aligned with business needs and requirnment, which are key to driving project success. Additionally, implementing a checklist, conducting continuous audits and getting stakeholders feedbacks to review data consistency and quality alignement with business needs, and continouse improvements efforts . There is no one-time solution; persistent and contionouse improvement is essential to nevigate unexpected data challenges.

Posted: Apr 14, 2024 12:43 PM

Emmanuel Ogunyinka Capitol Heights, MD, United States

I have encountered a few situations where data-related issues arose, and here's how I navigated and resolved them:

1. low-quality data: the data we had access to was plagued by errors, inconsistencies, or low-quality entries. We implemented data cleaning and preprocessing steps, including outlier detection, deduplication, and standardization, to improve the overall quality of the data before feeding it into our AI models.

2. Lack of domain-specific data: we faced a shortage of relevant, domain-specific data to train our AI models effectively. We collaborated with subject matter experts to manually curate and annotate a smaller but high-quality dataset, which we then used for transfer learning or few-shot learning techniques.

Posted: Apr 14, 2024 1:37 PM (Updated by author: Apr 14, 2024 1:38 PM)

< 1 ... 7 8 9 10 11 12 13 14 15 16 17 ... 42 >

Please login or join to reply

How do you navigate unexpected data challenges in your projects?

Sponsors

Vendor Events

Guessing is not a strategy: How to build decision velocity with AI and real-time data

Newsletters