Waibel, A. (1989). "Modular Construction of Time-Delay Neural Networks for Speech Recognition," Neural Computation, 1, 39-46.

Waibel, A., Hanazawa, T., Hinton, G., Shikano, K., and Lang, K. (1989). "Phoneme Recognition Using Time-Delay Neural Networks," IEEE Transactions on Acoustics, Speech, and Signal Processing, 37, 328-339.

Wang, C. (1991). A Robust System for Automated Decomposition of the Electromyogram Utilizing a Neural Network Architecture. Ph.D. Dissertation, Department of Electrical and Computer Engineering, Wayne State University, Detroit, Michigan.

Wang, Y.-F., Cruz Jr., J. B., and Mulligan Jr., J. H. (1990). "Two Coding Strategies for Bidirectional Associative Memory," IEEE Transactions on Neural Networks, 1(1), 81-92.

Wang, Y.-F., Cruz Jr., J. B., and Mulligan Jr., J. H. (1991). "Guaranteed Recall of All Training Pairs for Bidirectional Associative Memory," IEEE Transactions on Neural Networks, 2(6), 559-567.

Wasan, M. T. (1969). Stochastic Approximation. Cambridge University Press, New York.

Watta, P. B. (1994). "A Coupled Gradient Network Approach for Static and Temporal Mixed Integer Optimization," Ph.D. Dissertation, Department of Electrical and Computer Engineering, Wayne State University, Detroit, Michigan.

Waugh, F. R., Marcus, C. M., and Westervelt, R. M. (1991). "Reducing Neuron Gain to Eliminate Fixed-Point Attractors in an Analog Associative Memory," Phys. Rev. A, 43, 3131-3142.

Waugh, F. R., Marcus, C. M., and Westervelt, R. M. (1993). "Nonlinear Dynamics of Analog Associative Memories," in Associative Neural Memories: Theory and Implementation, M. H. Hassoun, Editor, 197-211. Oxford University Press, New York.

Wegstein, J. H. (1958). "Accelerating Convergence in Iterative Processes," ACM Commun., 1(6), 9-13.

Weigend, A. S. and Gershenfeld, N. A. (1993). "Results of the Time Series Prediction Competition at the Santa Fe Institute," Proceedings of the IEEE International Conference on Neural Networks (San Francisco 1993), vol. III, 1786-1793. IEEE, New York.

Weigend, A. S. and Gershenfeld, N. A., Editors (1994). Time Series Prediction: Forecasting the Future and Understanding the Past. Proc. of the NTAO Advanced Research Workshop on Comparative Time Series Analysis (Santa Fe 1992). Addison-Wesley, Reading MA.

Weigend, A. S., Rumelhart, D. E., and Huberman, B. A. (1991). "Generalization by Weight-Elimination with Application to Forecasting," in Advances in Neural Information Processing Systems 3 (Denver 1990), R. P. Lippmann, J. E. Moody, and D. S. Touretzky, Editors, 875-882. Morgan Kaufmann, San Mateo.

Weisbuch, G. and Fogelman-Soulié, F. (1985). "Scaling Laws for the Attractors of Hopfield Networks," Journal De Physique Lett., 46(14), L-623-L-630.

Werbos, P. (1974). "Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Sciences," Ph.D. Dissertation, Committee on Applied Mathematics, Harvard University, Cambridge, MA.

Werbos, P. J. (1988). "Generalization of Backpropagation with Application to Gas Market Model," Neural Networks, 1, 339-356.

Werntges, H. W. (1993). "Partitions of Unity Improve Neural Function Approximators," in Proceedings of the IEEE International Conference on Neural Networks (San Francisco 1993), vol. II, 914-918. IEEE, New York.

Wessels, L. F. A. and Barnard, E. (1992). "Avoiding False Local Minima by Proper Initialization of Connections," IEEE Transactions on Neural Networks, 3(6), 899-905.

Wettschereck, D. and Dietterich, T. (1992). "Improving the Performance of Radial Basis Function Networks by Learning Center Locations," in Advances in Neural Information Processing Systems 4 (Denver 1991), J. E. Moody, S. J. Hanson, and R. P. Lippmann, Editors, 1133-1140. Morgan Kaufmann, San Mateo.

White, H. (1989). "Learning in Artificial Neural Networks: A Statistical Perspective," Neural Networks, 1, 425-464.

White, S. A. (1975). "An Adaptive Recursive Digital Filter," in Proc. 9th Asilomar Conf. Circuits Syst. Comput. (San Francisco 1975), 21-25. Western Periodicals, North Hollywood, CA.

Whitley, D. and Hanson, T. (1989). "Optimizing Neural Networks Using Faster, More Accurate Genetic Search," in Proceedings of the Third International Conference on Genetic Algorithms (Arlington 1989), J. D. Schaffer, Editor, 391-396. Morgan Kaufmann, San Mateo.

Widrow, B. (1987). "ADALINE and MADALINE - 1963," Plenary Speech, Proc. IEEE 1st Int. Conf. on Neural Networks (San Diego 1982), vol. I, 143-158.

Widrow, B. and Angell, J. B. (1962). "Reliable, Trainable Networks for Computing and Control," Aerospace Eng., 21 (September issue), 78-123.

Widrow, B. and Hoff Jr., M. E. (1960). "Adaptive Switching Circuits," IRE Western Electric Show and Convention Record, Part 4, 96-104.

Widrow, B. and Lehr, M. A. (1990). "30 Years of Adaptive Neural Networks: Perceptron, Madaline, and Backpropagation," Proc. IEEE, 78(9), 1415-1442.

Widrow, B. and Stearns, S. D. (1985). Adaptive Signal Processing, Prentice-Hall, Englewood Cliffs.

Widrow, B., Gupta, N. K., and Maitra, S. (1973). "Punish/Reward: Learning with a Critic in Adaptive Threshold Systems," IEEE Trans. on System, Man, and Cybernetics, SMC-3, 455-465.

Widrow, B., McCool, J. M., Larimore, M. G., and Johnson Jr., C. R. (1976). "Stationary and Nonstationary Learning Characteristics of the LMS Adaptive Filter," Proc. IEEE, 64(8), 1151-1162.

Wieland, A. P. (1991). "Evolving Controls for Unstable Systems," in Connectionist Models: Proceedings of the 1990 Summer School (Pittsburgh 1990), D. S. Touretzky, J. L. Elman, and G. E. Hinton, Editors, 91-102. Morgan Kaufmann, San Mateo.

Wieland, A. and Leighton, R. (1987). "Geometric Analysis of Neural Network Capabilities," First IEEE Int. Conf. on Neural Networks (San Diego 1987), vol. III, 385-392. IEEE, New York.

Wiener, N. (1956). I Am a Mathematician. Doubleday, NY.

Wilkinson, J. H. (1965). The Algebraic Eigenvalue Problem. Oxford University Press, Oxford, UK.

Williams, R. J. (1987). "A Class of Gradient Estimating Algorithms for Reinforcement Learning in Neural Networks," in IEEE First International Conference on Neural Networks (San Diego 1987), M. Caudill and C. Butler, Editors., vol. II, 601-608. IEEE, New York.

Williams, R. J. (1992). "Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning," Machine Learning, 8, 229-256.

Williams, R. J. and Zipser, D. (1989a). "A Learning Algorithm for Continually Running Fully Recurrent Neural Networks," Neural Computation, 1(2), 270-280.

Williams, R. J. and Zipser, D. (1989b). "Experimental Analysis of the Real-Time Recurrent Learning Algorithm," Connection Science, 1, 87-111.

Willshaw, D. J. and von der Malsburg, C. (1976). "How Patterned Neural Connections can be set up by Self-Organization," Proceedings of the Royal Society of London, B 194, 431-445.

Winder, R. O. (1962). Threshold Logic, Ph.D. Dissertation, Dept. of Mathematics, Princeton University, NJ.

Winder, R. O. (1963). "Bounds on Threshold Gate Realizability," IEEE Trans. Elec. Computers, EC-12(5), 561-564.

Wittner, B. S. and Denker, J. S. (1988). "Strategies for Teaching Layered Networks Classification Tasks," in Neural Information Processing Systems (Denver 1987), D. Z. Anderson, Editor, 850-859. American Institute of Physics, New York.

Wong, Y.-F. and Sideris, A. (1992). "Learning Convergence in the Cerebellar Model Articulation Controller," IEEE Trans. on Neural Networks, 3(1), 115-121.