Page MenuHomePhabricator

[epic] Dedupe CiviCRM
Closed, ResolvedPublic

Description

Deduping Civi will not only clean up our database, but include new tools to prevent dupes from being added to the system moving forward. This will always rely on some degree of manual intervention, but we'll automate what we can.

Phase 1

  • Scan for potential duplicates in a background job, tagging similar pairs and annotating with comparison results.

Phase 2

  • Display potential duplicates in a GUI table. These will be broken out (qualitatively) by automated match confidence, and split into a batched workflow for humans.
  • Allow admins to mark suspected duplicate pairs as a confirmed match.
  • Gather feedback and determine whether we can skip manual review of the high-confidence match categories.

Phase 3

  • Perform actual merging. This will require some work to make merging a reversible operation.

At the writing of this, there is an imminent phabricator upgrade which will change the story point field. The previous value was "supersized". It is now 0 and will need a new number at a later date.

Related Objects

StatusSubtypeAssignedTask
ResolvedEileenmcnaughton
ResolvedNone
ResolvedNone
ResolvedEileenmcnaughton
OpenNone
Resolvedawight
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedJgreen
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedEileenmcnaughton
ResolvedJgreen
InvalidJgreen
ResolvedEileenmcnaughton

Event Timeline

atgo raised the priority of this task from to Needs Triage.
atgo updated the task description. (Show Details)
atgo added a project: Fundraising-Backlog.
atgo changed Security from none to None.
atgo subscribed.
atgo triaged this task as High priority.Jan 9 2015, 7:59 PM
atgo lowered the priority of this task from High to Medium.Jan 26 2015, 10:59 PM
nshahquinn-wmf renamed this task from Dedupe Civi to Dedupe CiviCRM.May 19 2015, 4:10 PM
atgo renamed this task from Dedupe CiviCRM to [epic] Dedupe CiviCRM.Jun 11 2015, 10:53 PM

Noting a symptom of this problem: T120214

Eileenmcnaughton claimed this task.
Eileenmcnaughton subscribed.

This was done to the point where it became business-as-usual