Learning & Innovation Research Manager| Project Management Institute (PMI)Spain
Data quality and quantity is particularly important as we think about leveraging AI on projects. Considerations include the diversity and comprehensiveness of the data that is available to us.
Have you ever encountered unexpected challenges or pitfalls while using data in your projects? How did you navigate the situation and find a resolution? Saving Changes...
For the growth of artificial intelligence, information sharing is a priority. What is the boundary between data that is private and data that is public? This is a question I constantly ask myself. Saving Changes...
Anonymous
Data quality and quantity are critical when working with projects. We've encountered challenges like incomplete datasets or inconsistencies in data formats. In such situations, I've worked closely with our data team to identify the gaps and standardize the data. Concurrently we supplemented existing data with external sources. Doing this has helped us maintain the integrity of data achieve more reliable results. Saving Changes...
When faced with a critical incident or anomaly during a project, the primary responsibility falls to the Project Manager to make an important decision, sometimes immediately. The reputation of the company, the risks involved, the quality of services or data, the proposal of an effective and efficient solution, security, compliance... are prerequisites in decision-making. What we then recommend is adaptability and flexibility to change because in today's world new technologies require adaptation. Saving Changes...
Which Data Quality Management Software is appropriate currently? Saving Changes...
Sanghamitra KrishnamoorthySenior Project Manager| Customer Value Partner CorporationDunn Loring, Va, United States
Feb 01, 2024 4:13 PM
Replying to Verónica Elizabeth Pozo Ruiz
...
The main pitfalls that we can encounter when working with large amounts of data are inaccuracies, incoherency, and duplicated or outdated data. It's appropriate to use Data Quality Management Software to identify and correct these issues, and also monitor and synchronize data across the company, keeping database quality perfect.
Its always a struggle in large organizations to identify the true owner of data, and prevent replication / creation of outdated data. There are multiple vendors offering data management software - has anyone had any experience using or implementing Illumio ? Saving Changes...
Anonymous
I have yet experienced data pitfalls on my projects, I have best insight through this course on how to monitor and control data.
Thank you Saving Changes...
Ocholi AusaProfessional Graduate Programme| Smith's School of Business Queen's University TorontoToronto, Canada
By considering a methodical approach to preparing it using standard best practices and frameworks that ensures that the data is cleaned, preprocessed, explored and certified fit for use before analysing or using it for its intended purpose. The quality and integrity of data impacts on its output and by implication decisions made off it. Analyzing, preprocessing and engaging in exploratory data analysis is an inevitable ritual for any practitioner or organisation who depends on it to deliver on a task. Saving Changes...
Daniela ZuppichinDirector de Proyectos, Agile Trainer| SiPro360 Project ConsultingCiudad Autonoma de Buenos Aires, Argentina
The quality and quantity of data are absolutely crucial when working with artificial intelligence. Machine learning models are trained on large datasets, so the diversity and comprehensiveness of those datasets will greatly determine the performance and capabilities of the resulting model.
In my own projects, we encounter several challenges related to data:
Data bias: Often, available datasets may have internal biases, either from the way they were collected or from existing biases in the real world. This can lead to models training on those biases, resulting in unfair or inaccurate outcomes.
Noise in the data: Real-world data is often noisy and incomplete. Things like human errors, outliers, missing data, etc., can excessively hinder effective model training.
Data availability: In some domains, there simply isn't enough publicly available data to train robust models. Gathering new datasets can be a very costly and time-consuming process.
To mitigate these challenges, some strategies we use include:
Thorough data preprocessing: We invest a lot of effort in cleaning, filtering, and properly formatting the data before training. This helps handle noise and missing values.
Data augmentation techniques: When data is scarce, we apply techniques such as data translation, random cropping, etc., to generate additional synthetic data.
Using multiple data sources: We combine different public and private datasets to mitigate biases and increase diversity.
Bias evaluation: We conduct specific tests to assess and mitigate potential biases in the trained models.
In summary, working with high-quality data in large quantities remains one of the biggest challenges in AI. It requires a careful and proactive approach to ensure that models are fair, accurate, and robust. Unfortunately, many companies still do not consider investing proactively in this.
Regards from the end of the world!
DZ.-
Saving Changes...
Mukesh SengarProject Manager| XL IMPE INC, ATIKA TECHNOLOGIESMagnolia, Tx, United States
Addressing unexpected data challenges requires ongoing enhancements in our data governance policies and controls throughout the data lifecycle. This process begins with data collection from various sources, and based on data source discovery, realizing the data profiling, which involves collaboration between business analysts and data engineering experts. Furthermore, coordinating data consolidation & data agregation with the cordination of business stakeholders is crucial. Maintaining data integrity and implementing data security measures are essential to producing high-quality and consistent data aligned with business needs and requirnment, which are key to driving project success. Additionally, implementing a checklist, conducting continuous audits and getting stakeholders feedbacks to review data consistency and quality alignement with business needs, and continouse improvements efforts . There is no one-time solution; persistent and contionouse improvement is essential to nevigate unexpected data challenges. Saving Changes...
I have encountered a few situations where data-related issues arose, and here's how I navigated and resolved them:
1. low-quality data: the data we had access to was plagued by errors, inconsistencies, or low-quality entries. We implemented data cleaning and preprocessing steps, including outlier detection, deduplication, and standardization, to improve the overall quality of the data before feeding it into our AI models.
2. Lack of domain-specific data: we faced a shortage of relevant, domain-specific data to train our AI models effectively. We collaborated with subject matter experts to manually curate and annotate a smaller but high-quality dataset, which we then used for transfer learning or few-shot learning techniques. Saving Changes...