Release notes

Version 0.15

  • Remove deprecated recordlinkage classes (#173)
  • Bump min Python version to 3.6, ideally 3.8+ (#171)
  • Bump min pandas version to >=1
  • Resolve deprecation warnings for numpy and pandas
  • Happy lint, sort imports, format code with yapf
  • Remove unnecessary np.sort in SNI algorithm (#141)
  • Fix bug for cosine and qgram string comparisons with threshold (#135)
  • Fix several typos in docs (#151)(#152)(#153)(#154)(#163)(#164)
  • Fix random indexer (#158)
  • Fix various deprecation warnings and broken docs build (#170)
  • Fix broken docs build due to pandas depr warnings (#169)
  • Fix broken build and removed warning messages (#168)
  • Update narrative
  • Replace Travis by Github Actions (#132)
  • Fix broken test NotFittedError
  • Fix bug in low memory random sampling and add more tests (#130)
  • Add extras_require to setup.py for deps management
  • Add banner to README and update title
  • Add Binder and Colab buttons at tutorials (#174)

Special thanks to Tomasz Waleń @twalen and other contributors for their work on this release.

Version 0.14

  • Drop Python 2.7 and Python 3.4 support. (#91)
  • Upgrade minimal pandas version to 0.23.
  • Simplify the use of all cpus in parallel mode. (#102)
  • Store large example datasets in user home folder or use environment variable. Before, example datasets were stored in the package. (see issue #42) (#92)
  • Add support to write and read annotation files for recordlinkage ANNOTATOR. See the docs and https://github.com/J535D165/recordlinkage-annotator for more information.
  • Replace .labels by .codes for pandas.MultiIndex objects for newer versions of pandas (>0.24). (#103)
  • Fix totals for pandas.MultiIndex input on confusion matrix and accuracy metrics. (see issue #84) (#109)
  • Initialize Compare with (a list of) features (Bug). (#124)
  • Various updates in relation to deprecation warnings in third-party libraries such as sklearn, pandas and networkx.