Two-stage index computation for bandits with switching penalties II : switching delaysReportar como inadecuado




Two-stage index computation for bandits with switching penalties II : switching delays - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

Editor: Universidad Carlos III de Madrid. Departamento de Estadística

Issued date: 2007-05

Serie-No.: UC3M Working papers. Statistics and Econometrics07-10

Keywords: Dynamic programming , Semi-Markov , Finite state , Bandits , Switching delays , Index policy , Whittle index , Hysteresis , Work-reward analysis , LP-indexability , Analysis of algorithms

Rights: Atribución-NoComercial-SinDerivadas 3.0 España

Abstract:This paper addresses the multi-armed bandit problem with switching penalties including both costs and delays, extending results of thecompanion paper J. Niño-Mora. -Two-Stage Index Computation for Bandits with Switching Penalties I: Switching Costs-. ConditThis paper addresses the multi-armed bandit problem with switching penalties including both costs and delays, extending results of thecompanion paper J. Niño-Mora. -Two-Stage Index Computation for Bandits with Switching Penalties I: Switching Costs-. Conditionally accepted at INFORMS J. Comp., which addressed the no switching delays case. Asawa and Teneketzis (1996) introduced an index for bandits with delays that partly characterizes optimal policies, attaching to each bandit state a -continuation index- (its Gittins index) and a -switching index-, yet gave no algorithm for it. This paper presents an efficient, decoupled computation method, which ina first stage computes the continuation index and then, in a second stage, computes the switching index an order of magnitude faster in at most (5-2)n^3+O(n) arithmetic operations for an n -state bandit. The paper exploits the fact that the Asawa and Teneketzis index is the Whittle, or marginal productivity, index of a classic bandit with switching penalties in its semi- Markov restless reformulation, by deploying work-reward analysis and LP-indexability methods introduced by the author. A computational study demonstratesthe dramatic runtime savings achieved by the new algorithm, the near-optimality of the index policy, and its substantial gains against a benchmark index policy across a wide instance range.+-





Autor: Niño-Mora, José

Fuente: http://e-archivo.uc3m.es


Introducción



1698-482X 2 111A11X UNIVERSIDAD CARLOS III DE MADRID INSTITUTO DE POLÍTICA Y GOBERNANZA (IPOLGOB) DEPARTAMENTO DE CIENCIA POLÍTICA Y SOCIOLOGÍA ÁREA DE CIENCIA POLÍTICA Y DE LA ADINISTRACIÓN Documento de Trabajo n 19-2009 EL CONFLICTO TRIBAL DE IRLANDA DEL NORTE JAVIER RUIZ MARTÍNEZ* Abstract: From the end of the 60s to the end of the 90s, Northern Ireland underwent a tough process of political violence.
The political conflict passed to be placed into the sphere of party competition to the arena of tribal violence and back again.
This paper summarizes the main facts and reasons that provoked the conflict and the way to get out of it.
It finishes with the conclusion of the Belfast Agreement and the devolution of powers to the new Northern Ireland government. Key Words: Northern Ireland, nationalism, terrorism, political violence. Palabras clave: Irlanda del Norte, nacionalismo, terrorismo, violencia política. Javier Ruiz Martínez es profesor del Departamento de Ciencia Política y Sociología de la Universidad Carlos III de Madrid. * 3 CONTENIDO 1.
El conflicto de Irlanda del Norte .
5 2.
El avance de la violencia terrorista en Irlanda del Norte .
13 3.
La búsqueda de una solución coparticipada al conflicto: cambios en los actores estatales.
26 4.
El impulso al proceso de normalización democrática .
38 BIBLIOGRAFÍA .
42 4 El conflicto tribal de Irlanda del Norte 1.
El conflicto de Irlanda del Norte Desde finales de los sesenta, un buen número de Estados de Europa Occidental se vieron fustigados por una ola de violencia paramilitar organizada que puso ...





Documentos relacionados