A wolf does not charge a moose head-on. It runs toward the place the moose will be, and it arrives there beside other wolves who each compute that same future from a different spot on the field. When the kill comes, it comes as geometry. The prey has one body and a small set of directions it can break toward, and the pack has arranged itself so that whichever direction the animal chooses already has a wolf folding into it. This is among the oldest forms of coordinated violence, wired into mammal nervous systems millions of years before anyone built a machine that could copy it. A certain kind of artificial intelligence has begun to learn the same lesson, badly at first and then with a competence that should make us uneasy.

The picture most of us carry of that hunt is part myth. Dogs and wolves almost never run as two cooperating packs, the way the question is sometimes posed, and feral dogs are mostly scavengers whose inherited machinery for group hunting was worn down by ten thousand years of living off human refuse. Even the wolf’s coordination is argued over by the people who study it for a living, and that argument is the part worth stealing. One camp, working from computer models, says the encirclement needs no plan at all. A 2011 simulation by Muro and colleagues showed that the whole ballet of surrounding prey falls out of two rules followed by each wolf, with no leader issuing orders: move toward the prey until you reach a minimum safe distance, then, once you are holding at that distance, move away from the other wolves who are holding there too. Run that across five or six agents and you get spacing around the target, a ring that tightens on its own, and a gap that snaps shut when the prey bolts. The other camp, built on years in the field watching wolves take elk and bison, says the truth is messier and more impressive: seasoned animals hold particular stations, younger ones learn by failing, and who plays which role shifts with whoever stands best placed in the moment.

For anyone building software, that disagreement carries weight. A pack that depends on a central planner has a bottleneck and a throat to cut, since removing the planner ends the hunt. A pack that coordinates through simple local rules has neither weakness. It scales to more agents without choking on communication, and it keeps working when one member falls, because no member was carrying the whole design in its skull. The engineer who wants to copy the wolf has to decide first which wolf the science actually describes, because the cheap emergent version and the expensive learned version yield different machines with different failure points.

Set the biology down and the structure underneath it becomes a question about angles. An angle, in this sense, is one independent line along which force arrives. A lone attacker, however strong, offers the target a single angle, and a single angle is a single thing to model, a single tempo to time, a single defense to build. The pack’s gift is that it manufactures angles the prey cannot collapse back into one problem. The moose is fast enough to outrun any one wolf and strong enough to kill any one wolf, and none of that saves it, because it is being asked to solve four geometry problems at once with one body that can point in only one direction. Add angles and you do more than pile on pressure. You change the category of the problem the target faces, from a contest it might win into a trap with no exit.

The three things a clean hunt denies the prey are three separate engineering problems, and each wants its own answer. Denying the target the ability to fight back is a matter of saturation: you bring enough independent methods that whatever defense the target raises, another method is already past it. Saturation comes from redundancy across methods, from owning more ways through than the target owns ways to block. The second denial, escape, is a matter of coverage: every route the prey might take corresponds, inside a problem-solving system, to a region of the solution space where the answer could be hiding, and encirclement means leaving no region unwatched. Parity, the third denial, is the hard one, and it is a matter of time.

An opponent reaches parity by adapting faster than you can spend your advantage. Hand it one front and you hand it a clean signal to read and a rhythm to counter. Strike from several independent angles in the same instant and you strip away the interval it needs to reorganize, so that by the time it has braced against the first angle it has already lost ground to the second and the third. The target state carries a name borrowed from chess: zugzwang, the position in which every legal move a player can make leaves him worse off than before he moved. A hunt done right walks the prey into zugzwang, where running, turning, and standing still have all become losing moves.

None of this is new mathematics. In 1916, working out the arithmetic of aerial combat, the British engineer Frederick Lanchester wrote down what became his square law. When fighters can concentrate aimed fire on chosen targets, the strength of a force grows with the square of its numbers rather than in simple proportion, so that doubling your concentrated force roughly quadruples its effect. The reading for a hunter is blunt. Spreading effort evenly across a whole herd squanders it; the winning move is to select one animal and bring the full weight of the group onto it while that animal cannot bring concentrated defense back. The pack does this without arithmetic. It reads the herd, finds the lame one or the young one or the one that has drifted from the others, and commits everything to that selection. Target choice comes before the attack, and the attack is local superiority built on purpose at a point the pack has chosen.

Humans wrote the wolf’s lesson into doctrine long before they wrote it into code. In the Battle of the Atlantic, German submarine command under Karl Dönitz used a method its own crews called Rudeltaktik, wolfpack tactics, and the name was literal. A single U-boat that found an Allied convoy would not attack alone. It shadowed the ships, radioed their position and course, and waited while other boats gathered over the horizon, and only then, usually at night and on the surface, did the assembled pack strike from several bearings at once. A convoy’s escorts could chase one contact and lose the others, because a screen built to defend against a threat from one direction comes apart when the threat arrives from six. The tactic nearly strangled Britain’s supply line. It was beaten, in the end, by the same logic that beats any pack, which is to close the angles. Long-range aircraft removed the safe stretch of ocean where the boats gathered, shipborne radar and high-frequency direction-finding stripped away the darkness and the radio silence the method depended on, and the breaking of the German naval cipher let the convoys route around the waiting packs entirely. The hunters were encircled by a defense that learned to cover every angle they had relied on.

Artificial intelligence already owns fragments of this lesson, scattered across methods that were never built to speak to one another. Take three of them honestly. Boosting, the engine inside the gradient-boosted decision trees that win so many prediction contests, is effective because each new learner is trained specifically on the errors its predecessors left exposed, which is a direct computational echo of the pack reading a herd for its weakest member. Boosting is not effective for the larger problem described here because the committee it assembles attacks a stationary target: the data does not move, does not defend itself, and does not learn from being struck.

Swarm methods, particle swarm optimization and its relatives, take their cue from flocking birds and schooling fish. They are effective because a population of simple agents covers a solution space from many directions at once and rarely falls into the pit that swallows a lone searcher, which answers the coverage problem with real force. They are not effective on their own because they search close to blind, covering ground without first locating the weak point, so they deliver a quantity of angles without the intelligence of angle selection that makes a pack lethal rather than merely numerous. A close relative, ant colony optimization, fixes part of that gap by letting agents lay down and follow virtual pheromone trails, so that good paths reinforce themselves through the shared environment, and it works well on routing and scheduling problems for that reason.

Multi-agent reinforcement learning is the literal form of the question. Drop several predator agents and a prey agent into a shared environment, reward the predators for the catch, and let them train against one another, and real encirclement appears, along with role-switching that no one wrote by hand. It is effective because the coordination it produces is adaptive and emergent in the way the field biologists describe, with agents specializing into positions and adjusting to the prey in real time. It is not effective yet as a dependable tool because it is starved for training data, unstable to train, and miserable to debug, since the credit for a single win must be split among agents who all shaped the outcome in ways no one can cleanly separate.

Search itself is a quieter form of the many-angled attack, and it is where these systems already excel. The programs that beat the world’s best players at Go and chess do not study one promising line of play. They send out thousands of probes down thousands of branching futures at once, weight each by how well it tends to end, and concentrate their reading on the lines that matter, which is encirclement of a game tree in place of a moose. The target there is a space of possibilities, and the method floods it from every promising angle until the human across the board has no reply that does not lose.

What the animal has and most software still lacks comes down to three habits, and naming them tells you where the engineering ought to go. The first is reconnaissance before commitment. A pack spends real effort locating the vulnerable individual before it applies any force, while most optimizers attack the whole problem uniformly and hope something gives. Cheap coordination is the second habit. The pack holds together on almost no bandwidth, each animal reacting to the position of the prey and of its packmates, so the prey’s own location becomes the shared signal that keeps the group organized with no one broadcasting commands. This has a name in the study of insects, stigmergy, meaning coordination through marks left in a shared environment instead of direct messages, and it is how an ant colony organizes thousands of bodies through chemical trails alone. Most multi-agent software still leans on heavy explicit messaging and pays the cost in bandwidth and brittleness. The third habit is the willingness to break off. Wolves abandon hunts that look too costly, and few AI systems weigh the price of an attack against its likely return the way a hunting animal weighs whether a chase is worth the calories it will burn.

If the goal is one recommendation instead of a catalog, build the hybrid and weight it toward the part the field neglects. An architecture that denies a target all three things at once needs a reconnaissance phase that finds the weak point, borrowing boosting’s instinct for the residual error; a coordination layer that runs on signals left in a shared environment instead of constant chatter, borrowing stigmergy and Muro’s local rules so the system scales and survives the loss of any single agent; and an attack policy that applies pressure from several angles in the same moment, borrowing Lanchester’s logic of concentrated force at a chosen point. Of those three pieces, simultaneity is the one to carry out of the analogy above all others, because it is the piece that specifically defeats an adaptive enemy. Serial pressure, however heavy each blow, hands the target a recovery interval between strikes and lets it crawl toward parity. Pressure from independent angles in the same instant removes the interval, and with the interval, the parity it would have bought.

Here is where the copy stops being a copy, and where the question of machines inventing their own methods earns its answer. An animal pack is bound by flesh: four legs, one throat, the top speed a body can reach, the angles a physical field permits. A software pack is bound by none of that. Set it loose to design its own geometry of attack and it finds shapes no wolf could hold. It can open a hundred angles where a pack opens four. It can press on a target across dimensions that have no spatial meaning at all, working a network through timing and volume and forged trust and the statistics of the target’s own learning, all at the same time. The clearest public demonstration arrived in 2019, when researchers at OpenAI set teams of agents to play hide-and-seek against each other in a simulated room. Given nothing but the goal and millions of rounds to practice, the agents kept discovering exploits their makers never imagined, hiders barricading doors with boxes, then seekers learning to ride those boxes over the walls, each side inventing a new angle the moment the other closed the last one, the competition itself acting as a teacher that kept raising the difficulty. A system trained to attack from many angles will, given room to learn, design angles its trainers did not know existed, and that invention is the prize and the threat in a single object. The dark mirror already runs in the wild: a botnet mounting a distributed denial-of-service attack is a crude software pack, thousands of hijacked machines striking one target from thousands of addresses at once so that no single block can stop them all.

Which brings the matter to its hard floor. What I have been describing in the clean vocabulary of optimization is a model of predation, and predation refined toward perfection is a model of domination. A target that can neither fight nor flee nor reach parity has been moved out of competition and into destruction, and the soft language of solving problems should not be allowed to hide what the structure does. The same architecture that lets a research system corner a stubborn optimization problem is what makes a swarm of autonomous agents dangerous when the target is a power grid, a financial market, an election, or a single person standing in a square. History is crowded with men who admired the efficiency of the pack and built states in its image, and the wreckage they left behind ought to teach that the elegance of a method says nothing about the right to use it. The wolf is an honest teacher of how to win. It holds no opinion on whether you should, and that question, the only one that finally counts, was never the animal’s to answer. It is ours.

Leave a ReplyCancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.