27 May 2023 |
| [bot] joined the room. | 04:14:11 |
30 May 2023 |
| Aminda.eu ⚧︎ (she/they) joined the room. | 14:20:24 |
Aminda.eu ⚧︎ (she/they) | Redacted or Malformed Event | 14:24:08 |
| Cat joined the room. | 14:31:06 |
madlittlemods (Eric Eastwood) | @aminda:pikaviestin.fi: Since there isn't any way to tell whether a room is public/world_readable without joining it, the archive user will join any room requested. But it will only serve content for public rooms and only allow indexing for history_visibility: "world_readable" rooms.
There is no mechanism for leaving a room. But the bot will try to join a room again any time it's requested. It seems like this covers the case around libera.chat?
| 14:48:08 |
| tulir joined the room. | 15:00:35 |
Aminda.eu ⚧︎ (she/they) | Redacted or Malformed Event | 15:08:44 |
tulir | In reply to @madlittlemods:matrix.org
@aminda:pikaviestin.fi: Since there isn't any way to tell whether a room is public/world_readable without joining it, the archive user will join any room requested. But it will only serve content for public rooms and only allow indexing for history_visibility: "world_readable" rooms.
There is no mechanism for leaving a room. But the bot will try to join a room again any time it's requested. It seems like this covers the case around libera.chat?
could use https://github.com/matrix-org/matrix-spec-proposals/pull/3266 for checking visibility before joining? | 15:10:46 |
madlittlemods (Eric Eastwood) | In reply to @aminda:pikaviestin.fi
Why does the source code also share history for shared or members only visibility? https://github.com/matrix-org/matrix-public-archive/pull/239
It depends on whether the bot will first leave some of the 250 other rooms or get idle kicked from them as libera.chat will not allow the ghost to join any more channels Any public room should be viewable in Matrix Public Archive. The idea is if a random Matrix user can view the room, then it should be viewable in the archive. But only history_visibility: "world_readable" rooms are indexable by search engines. Also keep in mind that the Matrix Public Archive holds onto no data (it's stateless) and requests the data from the homeserver every time. archive.matrix.org does have some caching in place (5 minutes for the current day, and 2 days for past content).
I'm currently clarifying this kind of thing in multiple places right now and will add this info to the FAQ
Ahh, sounds like a detail of libera.chat that we don't handle then. There is no special casing for community rules like that
| 15:16:48 |
| Gnuxie 💜🐝 joined the room. | 16:02:46 |
Gnuxie 💜🐝 | Redacted or Malformed Event | 16:08:24 |
madlittlemods (Eric Eastwood) | Added in https://github.com/matrix-org/matrix-public-archive/pull/241 ✅ | 16:10:43 |
madlittlemods (Eric Eastwood) |
I'm currently clarifying this kind of thing in multiple places right now and will add this info to the FAQ
Added in https://github.com/matrix-org/matrix-public-archive/pull/241
| 16:11:20 |
TravisR | (clearing stuck notification with a message) | 16:13:33 |
Gnuxie 💜🐝 | So the justification is that all history settings for a joinable room equate to world readable. Whether that's true isn't the bad part, the bad part is that those settings suck and only provide comfort not security. | 16:27:55 |
Gnuxie 💜🐝 | * So the justification is that all history settings for a joinable room equate to world readable. Whether that's true isn't the bad part, the bad part is that those settings suck and only provide comfort, not security. | 16:28:33 |
Gnuxie 💜🐝 | And you could say "well it should be obvious (that the different settings for history visibility in a "public" room aren't doing much for you) but it isn't obvious, since rooms use them in belief that it adds some complexity to obtaining history | 16:32:44 |
Gnuxie 💜🐝 | * And you could say "well it should be obvious" (that the different settings for history visibility in a "public" room aren't doing much for you) but it isn't obvious, since rooms use them in belief that it adds some complexity to obtaining history | 16:33:09 |
| @dibi58:matrix.org joined the room. | 16:37:46 |
madlittlemods (Eric Eastwood) | Gnuxie 💜🐝: The archive only shows rooms in these cases:
m.room.history_visibility is world_readable
m.room.history_visibility is shared and m.room.join_rules is public
shared room visibility` means "All events in the room are accessible, even those sent when the member was not a part of the room."
| 16:41:50 |
Aminda.eu ⚧︎ (she/they) | Redacted or Malformed Event | 16:44:39 |
| Jonah [matrix.org] joined the room. | 18:53:17 |
| bkil joined the room. | 22:35:13 |
bkil | Redacted or Malformed Event | 22:36:14 |
bkil | What is the latency of the archive interface? | 22:36:57 |
bkil | Redacted or Malformed Event | 22:40:20 |
bkil | Redacted or Malformed Event | 22:40:30 |
bkil | Redacted or Malformed Event | 22:46:50 |
bkil | Redacted or Malformed Event | 22:53:31 |
madlittlemods (Eric Eastwood) | bkil: The archive pulls data directly from the homeserver for every request (it stores nothing). There is some caching on top though (5-minutes for the current day and 2-days for past content) | 22:57:54 |