Discussion:
Quantity and addresses of antique Usenet peers that carry full text feeds?
(too old to reply)
SugarBug
2024-03-28 19:11:43 UTC
Permalink
I wonder how many peers are carrying and preserving fulltext historical and curent Usenet feeds.

Will anyone give a ballpark number?

Which peers have articles remotest in antiquity?

Even guesstimates might be useful.
--
***@sugar.bug | sybershock.com | sci.crypt
Ivo Gandolfo
2024-03-28 20:52:15 UTC
Permalink
Post by SugarBug
I wonder how many peers are carrying and preserving fulltext historical and curent Usenet feeds.
Will anyone give a ballpark number?
Which peers have articles remotest in antiquity?
Even guesstimates might be useful.
Actually I have my full server back to 2000. Actually i'm working to add
other back from archive.org.


Sincerely
--
Ivo Gandolfo
Grant Taylor
2024-03-29 02:00:18 UTC
Permalink
Post by SugarBug
I wonder how many peers are carrying and preserving fulltext historical
and curent Usenet feeds.
Considering that all of the news servers that I've looked at have a
/default/ configuration to expire articles, you're actually asking for a
special configuration.

After Google killed Dejanews, (years ago) I would wonder if some of the
big binary friendly news service providers might happen to have a VERY
long text newsgroup archive simply based on size compared to binaries.

Beyond that, I'd start inquiring if anyone like the Library of Congress
has an archive. But I'd be somewhat surprised if they did. Or if they
did, I suspect that getting access to it in any capacity would be onerous.
--
Grant. . . .
John Levine
2024-03-29 02:15:29 UTC
Permalink
Post by Grant Taylor
After Google killed Dejanews, (years ago) I would wonder if some of the
big binary friendly news service providers might happen to have a VERY
long text newsgroup archive simply based on size compared to binaries.
My impression is that the commercial providers like Giganews stopped
expiring stuff a long time ago so their archives go way back.
--
Regards,
John Levine, ***@taugh.com, Primary Perpetrator of "The Internet for Dummies",
Please consider the environment before reading this e-mail. https://jl.ly
Kyonshi
2024-03-31 23:03:03 UTC
Permalink
Post by SugarBug
I wonder how many peers are carrying and preserving fulltext historical and curent Usenet feeds.
Will anyone give a ballpark number?
Which peers have articles remotest in antiquity?
Even guesstimates might be useful.
The question is how much of that really is useful, considering the flood
of spam the last few decades.
candycanearter07
2024-04-01 15:16:36 UTC
Permalink
Post by Kyonshi
Post by SugarBug
I wonder how many peers are carrying and preserving fulltext historical and curent Usenet feeds.
Will anyone give a ballpark number?
Which peers have articles remotest in antiquity?
Even guesstimates might be useful.
The question is how much of that really is useful, considering the flood
of spam the last few decades.
There was still plenty of legitimate conversation, and
filtering/skipping the spam should be easier in retrospect. You could
also apply the existing spam reports for a portion of it.
--
user <candycane> is generated from /dev/urandom
Billy G. (go-while)
2024-05-12 11:41:45 UTC
Permalink
Post by SugarBug
I wonder how many peers are carrying and preserving fulltext historical and curent Usenet feeds.
Will anyone give a ballpark number?
Which peers have articles remotest in antiquity?
Even guesstimates might be useful.
i've imported and deduped all available usenet backups/mbox files from
archive.org (several TB compressed) without any filtering and accepted
every group that showed up which results in 471k groups so far at the
Full-Node.

'news.software.nntp' for example dates back to 1987.

it should be readable via NNTP

Part-Node (111k groups)
file: http://104.244.74.85/usenet/active/part.active.txt
host: 104.244.74.85:11119
user: freefree
pass: freefree


Full-Node (471k groups)
file: http://104.244.74.85/usenet/active/full.active.txt
host: 104.244.74.85:11120
user: freefree
pass: freefree
Tor User
2024-09-27 17:43:08 UTC
Permalink
On Sun, 12 May 2024 13:41:45 +0200
Post by Billy G. (go-while)
Post by SugarBug
I wonder how many peers are carrying and preserving fulltext
historical and curent Usenet feeds.
Will anyone give a ballpark number?
Which peers have articles remotest in antiquity?
Even guesstimates might be useful.
i've imported and deduped all available usenet backups/mbox files
from archive.org (several TB compressed) without any filtering and
accepted every group that showed up which results in 471k groups so
far at the Full-Node.
'news.software.nntp' for example dates back to 1987.
it should be readable via NNTP
Part-Node (111k groups)
file: http://104.244.74.85/usenet/active/part.active.txt
host: 104.244.74.85:11119
user: freefree
pass: freefree
Full-Node (471k groups)
file: http://104.244.74.85/usenet/active/full.active.txt
host: 104.244.74.85:11120
user: freefree
pass: freefree
Unable to contact either server .....

torsocks telnet 104.244.74.85 11119
[proxychains] config file found: /etc/onion.proxychains.conf
[proxychains] preloading /usr/lib/x86_64-linux-gnu/libproxychains.so.4
Trying 104.244.74.85...
telnet: Unable to connect to remote host: Connection refused

torsocks wget http://104.244.74.85/usenet/active/full.active.txt
[proxychains] config file found: /etc/onion.proxychains.conf
[proxychains] preloading /usr/lib/x86_64-linux-gnu/libproxychains.so.4
--2024-09-27
12:40:55-- http://104.244.74.85/usenet/active/full.active.txt Connecting to 104.244.74.85:80... failed: Connection refused.
D
2024-09-27 18:36:02 UTC
Permalink
Post by Tor User
On Sun, 12 May 2024 13:41:45 +0200
Post by Billy G. (go-while)
Post by SugarBug
I wonder how many peers are carrying and preserving fulltext
historical and curent Usenet feeds.
Will anyone give a ballpark number?
Which peers have articles remotest in antiquity?
Even guesstimates might be useful.
i've imported and deduped all available usenet backups/mbox files
from archive.org (several TB compressed) without any filtering and
accepted every group that showed up which results in 471k groups so
far at the Full-Node.
'news.software.nntp' for example dates back to 1987.
it should be readable via NNTP
Part-Node (111k groups)
file: http://104.244.74.85/usenet/active/part.active.txt
host: 104.244.74.85:11119
user: freefree
pass: freefree
Full-Node (471k groups)
file: http://104.244.74.85/usenet/active/full.active.txt
host: 104.244.74.85:11120
user: freefree
pass: freefree
Unable to contact either server .....
torsocks telnet 104.244.74.85 11119
[proxychains] config file found: /etc/onion.proxychains.conf
[proxychains] preloading /usr/lib/x86_64-linux-gnu/libproxychains.so.4
Trying 104.244.74.85...
telnet: Unable to connect to remote host: Connection refused
torsocks wget http://104.244.74.85/usenet/active/full.active.txt
[proxychains] config file found: /etc/onion.proxychains.conf
[proxychains] preloading /usr/lib/x86_64-linux-gnu/libproxychains.so.4
--2024-09-27
12:40:55-- http://104.244.74.85/usenet/active/full.active.txt Connecting to 104.244.74.85:80... failed: Connection refused.
they were working for a few days (maybe weeks?) after this . . .
Post by Tor User
Date: Thu, 6 Jun 2024 01:39:33 +0200
Subject: Re: github.com/go-while/nntp-overview
Newsgroups: news.software.nntp
Organization: github.com/go-while
Post by Billy G. (go-while)
Hello World!
just want to create a topic for my repo. maybe anyone finds it useful.
https://github.com/go-while/nntp-overview
it is running great so far and connected to a usenet server i'm writing.
Author
go-while
It's running! got 1.5 billion articles in this overview code
mariadb with rocksDB engine is powering the hashdb now
I'v put all sha256 hashs of message-ids with size in rocksdb
4k tables 000-fff, cutting first 3 chars of hash
only 75gigs of hashs in this database :D
maybe a little more. zfs compression earned 13%
Billy G. (go-while)
[end quoted excerpt]

but the results had very distinctive gaps, as in incomplete;
akaik, blueworld is the most complete free nntp server with
about 21+ years retention with many older archived articles
in quite a few newsgroups that i have tested over the years;

see also utzoo-wiseman archive http://annex.retroarchive.org/utzoo,
https://archive.org/details/usenethistorical with many hierarchies;
Loading...