Hi,
I have opened pull request on GitHub regarding the mailing-list analysis:
https://github.com/siemens/codeface/pull/40
Here, the description as appearing on GitHub:
This pull request contains many fixes to the mailing-list analysis that
have been lying around on other branches (i.e., `mitchell_updates` and
`for-upstream`), but also bring much benefit and fix the analysis to a
great extent (called *old patches* below). Furthermore, there are
additional patches that fix some more things (called *new patches*
below).
These main fixes include the following:
- handling the case of proxy e-mails (e.g., `Adrian Prantl via llvm-dev
<llvm-dev@xxxxxxxxxxxxxx>`) [29d777d (old)],
- removal of problematic characters in author-mail strings [29d777d
(old), 1098eeb (old), ee1e7b4 (new)] and better handling of malformed
author-mail strings [e92a6dd (old), e3f58a1 (old, references #34)],
- enhancing the loading of mbox files [29ac3f0 (old), 9d5982e (old)],
- enhancement to storage of e-mails in the `mail` table of the DB and
removal of duplicate e-mails [0555675 (old), acf0e2e (old), 8c7263d
(new)].
Furthermore, this pull request includes some other patches:
- fix to really re-install GitHub packages in `packages.R` [99666d8
(new)],
- fix of indentation and whitespace in `codeface/R/ml/analysis.r`
[3e0a6f8 (new)],
- enhancement to a log message in ID service [4745e9a (new)], and
- addition of the package `screen` to the installation scripts [4f44923
(new)].
Attachment:
signature.asc
Description: OpenPGP digital signature