On Wed, Jan 14, 2026 at 1:52 PM Bertrand Drouvot
<bertranddrouvot.pg@gmail.com> wrote:
> postgres=# SELECT a1.author, a2.author,
> similarity(a1.author, a2.author) as similarity_score
> FROM top_authors2025 a1
> JOIN top_authors2025 a2 ON a1.author < a2.author
> WHERE similarity(a1.author, a2.author) > 0.6
> ORDER BY similarity_score DESC;
> author | author | similarity_score
> ----------------------+-----------------------+------------------
> Hou Zhijie [*] | Zhijie Hou [*] | 1
> Maksim Melnikov [*] | Melnikov Maksim [*] | 1
> Andrei Lepikhov [*] | Andrey Lepikhov [*] | 0.7777778
> Mihail Nikalayeu [*] | Mikhail Nikalayeu [*] | 0.75
> Lukas Fitti [*] | Lukas Fittl [*] | 0.71428573
> Dmitry Koval [*] | Dmitry Kovalenko [*] | 0.6666667
> (6 rows)
I have made these corrections:
update commits2025 set author = 'Hou Zhijie' where author = 'Zhijie Hou';
update commits2025 set author = 'Maksim Melnikov' where author =
'Melnikov Maksim';
update commits2025 set author = 'Andrei Lepikhov' where author =
'Andrey Lepikhov';
update commits2025 set author = 'Lukas Fittl' where author = 'Lukas Fitti';
Please let me know if you see anything else.
Thanks,
--
Robert Haas
EDB: http://www.enterprisedb.com