/sci/ - Science & Math

File: 9 KB, 312x375, circles.png [View same] [iqdb] [saucenao] [google]

Circles on arbitary points covering a part of the plane Anonymous Tue Feb 7 16:37:21 2012 No.4340373 [Reply] [Original]

Help me, /sci/, you're my only hope.

I have a set S of 1.7 million points on a 2D plane. I want to reduce that to a more manageable set, S', which is a subset of S such that for a chosen fixed radius r every point in the that plane [i.e. any point, not just those in S] that was within radius r of a point in S remains within radius of a point in S'.

The specific case is to do with postcodes in the UK; I want a minimal* list of postcodes so that every point in the UK is within a given distance of one.

*: I don't really care about it being provably minimal, but smaller is far better than bigger.

Any suggestions on how I should approach this?

>>	Anonymous Tue Feb 7 16:43:09 2012 No.4340388 Provably minimal would be impossible anyway unless given some futuristic magic computing machine. Let me think about this for a few minutes and I'll get back to you.

>>	Anonymous Tue Feb 7 16:52:13 2012 No.4340424 >>4340388 That'll be really appreciated, cheers.

TN5 !/sci/TN5.. Tue Feb 7 16:52:13 2012 No.4340426

Okay, so I'd first create an array of length 1.7M that contains the list of every node within 2r of the current node. In 2D, I'm pretty sure this takes less than O(n^2)[/spoiler] time and is actually doable. This also fits withing your RAM assuming you don't have that many postboxes too close to each other.

I'd create an empty list L which will store the nodes I want in the end.
Starting from there, I'd:
1) run through all the centers in my array, adding to L the nodes I go through, unless I've already added to L one of their neighbors: O(nd)[/spoiler] (where d[/spoiler] is the degree of the neighborhood graph),
2) run through all the centers, computing the maximum new area that a new center can cover if added to L: O(nd)[/spoiler] or O(nd^2)[/spoiler] or something like that which is linear in n[/spoiler] and maybe more than linear in d[/spoiler], let's say O(nf(d))[/spoiler],
3) run through all the centers, adding to L those that add almost the maximum new area (figure out this parameter by trial and error, the higher the better as long as the algorithm completes in a reasonable time), under the condition that no neighbors has been added yet: O(nd)[/spoiler] if you stored the new area per center, O(nf(d))[/spoiler] otherwise,
4) Go back to step 2 until your maximum is 0.

TN5 !/sci/TN5.. Tue Feb 7 16:55:37 2012 No.4340436

>>4340426
I think this is a realistic approach if f(d)[/spoiler] isn't too big for your value of d[/spoiler]. The only hard part is to code the procedure to figure out the new area covered by a new disk, but this can be estimated instead of being done in an exact manner. You can use squares instead of circles, for instance, maybe squares larger than the circle for the first passes of the algo, and then squares inscribed in the circle for the last passes, so the there is indeed no point left that should have been covered and isn't covered because of the approximation.

>>	TN5 !/sci/TN5.. Tue Feb 7 16:57:45 2012 No.4340443 Also if you don't mind, I think I'll use that problem in tests. It's kinda cool.

Anonymous Tue Feb 7 17:21:32 2012 No.4340510

> use that problem in tests.
Go for it. If people post code for other people on the internet, even better :)

The way I was thinking of attacking it was to create a grid of some kind (rectilinear/triangular overlaps) and then try to cover the gaps inbetween by adding new points; still undecided on how to tackle it, but will certainly consider your approach.

I'm expecting a LOT of neighbours. There's 1.7 million postcode centroids and the UK has a surface area of 94,000 square miles; so for a 5km radius that's around thousand or so postcodes on average; some regions (London) are going to have a lot more.

And yet in some regions (e.g. Scotland) they're probably of the order of miles apart from each other...

TN5 !/sci/TN5.. Tue Feb 7 17:28:29 2012 No.4340527

>>4340510
Yeah. I didn't think of it that way. If the array of neighbors still fits in the RAM, I think the idea can still work, even if it might be less efficient than I though. Otherwise I don't really see how it can be cleverly adapted, besides running the algorithm area by area with a limited number of points, and then once dense areas have been pruned, run it again on the result (losing a bit more of optimality on the borders of the areas, but whatever...).

Anonymous Tue Feb 7 17:41:57 2012 No.4340571

Oooh.
Sort list by distance from a single point.
Start at that point, our distance so far d=0
Find all postcodes at distances between d and d+2r
Sort by angle between north pole, origin and position of postcode.
Select points such that annulus of radius r..r+1 is complete
Iterate.
Any holes in that plan?

TN5 !/sci/TN5.. Tue Feb 7 17:54:25 2012 No.4340609

>>4340571
If I understood you well and you pick the centers in an annulus and you want some disks centered in it that cover the whole annulus, you might not be able to, since part of the coverage might come from the outer or inner annulus. However this can probably be fixed and is a way to reduce the problem to smaller problems (filling the annuli). I'm not sure whether it's actually better than dividing into regions with square shapes, for instance. I think that using squares as shapes reduces the amount of borders (if taking roughly same kind of size of subproblems), which reduces the border effects. Though, if for some reason it's much simpler to treat the subproblems because of the easy way in which you can sort the centers by angle, it would probably be a good pruning before maybe running a more complex and closer to optimal algorithm.

Advanced search
Text to find
Subject [?]Search by post subject. Leave empty for any.
Username [?]Search for user name. Leave empty for any user name.
Tripcode [?]Search for tripcode. Leave empty for any.
Email [?]Search by email. Leave empty for any.
Filename [?]Search by image filename. Leave empty for any.
From Date [?]Enter what date to start searching from. Format is YYYY-MM-DD
To Date [?]Enter what date to start searching until. Format is YYYY-MM-DD
Image hash
Search in	All Posts OPs Only
Deleted posts	Show all posts Show only deleted posts Only show non-deleted posts
Internal posts	Show all posts Show only internal posts Show only archived posts
Order	New posts first Old posts first
Capcode	All Posts Only by Users Only by Mods Only by Admins Only by Developers
Results	Posts Threads
Action	[ Simple ]