Category: RefactorEd.ai

Record Linkage in a Data Lake

Enterprises typically have various large data sets that are either in various enterprise systems, legacy systems and/or dumped into a big data lakes. With exponential generation of data from numerous sources and continuous storage of this data in inexpensive unstructured big data environment, “Record Linkage (RL)” is a huge challenge that all enterprise face when …

 

Road to datascience hackathon @ UTD

Colaberry learning team in partnership with data science club of University of Texas, Dallas organized a data science hackathon on October 28th 2017. It was one whirlwind tour of putting our data science learning platform https://refactored.ai that we are cooking in our labs to an interesting test in an uncontrolled environment. Here is a run …