Cybernetic Machines. Cyberspace
Because a main aim of many cyberneticians is to understand biological learning, various demonstrations have involved ‘‘learning machines’’ realized either as computer programs or as special-purpose hardware. The various schemes for artificial neural nets are examples, and an earlier one was the ‘‘Homeostat’’ of Ross Ashby, which sought a stable equilibrium despite disturbances that could include alteration of its physical structure.
A number of workers, starting with Grey Walter (4), made mobile robots or ‘‘tortoises’’ (land turtles) that showed remarkably lifelike behavior from simple internal control arrangements. They could avoid obstacles and would seek ‘‘food’’ (electric power) at charging stations when ‘‘hungry.’’ The ‘‘Machina speculatrix’’ by Grey Walter did not learn, actually, but later developments implemented learning in various forms.
A task that has been used in a number of studies is polebalancing, where the pole is an inverted pendulum constrained to pivot about a single axis and mounted on a trolley. The task is to control the trolley so that the pole does not fall and the trolley remains within a certain length of track. The input data to the learning controller are indications of the position of the trolley on the track and of the angle of the pendulum, and its output is a signal to drive the trolley. In one study, the controller was made to copy the responses of a human performing the task; in others, it developed its own control policy by trial.
Learning, unless purely imitative, requires feedback of success or failure, referred to as reinforcement. The term ‘‘reinforcement learning,’’ however, has been given special significance as indicating methods that respond not only to an immediate return from actions but also to a potential return associated with the change of state of the environment. A means of estimating an ultimate expected return, or value, for any state has to exist. The most favorable action is chosen to maximize the sum of the immediate return and the change in expected subsequent return. The means of evaluating states is subject, itself to modification by learning.
This extension of the meaning of ‘‘reinforcement learning,’’ having some correspondence to the ‘‘dynamic programming’’ of Richard Bellman, has led to powerful learning algorithms and has been applied successfully to the pole-balancing problem as well as to writing a program that learned to play a very powerful game of backgammon.
Cyberspace. Interactions using the Internet and other channels of ready computer communication are said to occur in, and to define, cyberspace. The new environment and resulting feeling of community are real and amenable to sociological examination. The prefix ‘‘cyber-’’ is applied rather indiscriminately to any entity strongly involving computer communication, so that a cafe offering its customers Internet access is termed a ‘‘cybercafe’’, the provision of bomb-making instructions on the Internet is described as ‘‘cyberterrorism,’’ and so on.
In science fiction, such terms as ‘‘cybermen’’ have been used to refer to humans who are subject to computer control. These uses of the prefix must be deprecated as supporting an erroneous interpretation of cybernetics.
Date added: 2024-06-15; views: 124;