Heuristic Scheduling Based on Policy Learning - Google Sites

06-06991 Research Skills

Heuristic Scheduling Based on Policy Learning Project Supervisor: Dr. Thorsten Schnier By: Vineet Khare (MSc. Natural Computation)

School of Computer Science University of Birmingham 5 December, 2001 1

Overview •

Introduction 1. Problem 2. Some Heuristic Rules. 3. Why learning?

•

Some Techniques (with Merits & Demerits) 1. Without Knowledge Modification. 2. With Knowledge Modification.

•

Conclusion

2

Introduction- 1 Problem •

Scheduling problem

•

Solution in One Step- Global Optimal Solution

•

Solution using Local Heuristics- Good (enough) Solution “Which batch of components should I process next on this machine?”

•

Various AI techniques 1. Expert Systems 2. Other Methods- Random, Blind, Heuristic. 3

Introduction- 2 Some Heuristic Rules(Dispatching Rule/ Scheduling Rules) These are Simple priority rules based on information available related to jobs. Information Available Processing Time Due dates Slack Arrival Time

Rule Shortest processing time (SPT) Earliest Due Date (EDD) Minimum Slack(MINISLK) First-in first-out (FIFO) etc.

4

Introduction- 3 Why Learning? •The system should be able to adapt to the changing environment. •They must include learning components to correct their misconceptions and improve their performance based on experience. •Difficulty of obtaining experts •Most environments are so dynamic that knowledge becomes obsolete too fast.

5

Some Techniques- 1 •

In General 4 Steps: 1. Obtaining knowledge from sources(Human Expert, Simulation data) 2. Store this knowledge in digital form(e.g. decision trees). 3. Making Decision 4. Modification of stored knowledge.

•

Some Evaluation Criterion 1. Robustness to unforeseen events 2. Performance on benchmark problems 3. Degree to which they are understood by humans.

6

Some Techniques- 2

By By: :H.Fisher, H.Fisher, G.L. Thompson G.L. Thompson 1961 1961

1. Probabilistic Learning Combinations of Local JSS Rules •Which rule to choose (SIO or LRT), at a particular decision point, is selected using a learning process. •Computational experimentation to produce learning how to vary the frequency of use of each of these rules.

+ Easy to implement, better than any one of the 2 rules used -Doesn’t give very good results -Not understandable by humans. 7

Some Techniques- 3

By: By: Quinlan Quinlan1986 1986

2. Iterative Dichotomister 3 (ID 3) •Uses examples to induce production rules (e.g. IF ... THEN ...), which form a simple decision tree.

•Classification: Nodes in a decision tree correspond to attributes of the objects to be classified, and the arcs are alternative values for these attributes. •The end nodes of the tree (leaves) indicate classes to which groups of objects belong. 8

Some Techniques- 4

By By:S.Piramuthu :S.Piramuthu etetal. al.1994 1994

3. Pattern Directed Scheduling

•Decision Support System that adaptively utilizes the most appropriate heuristics given the current state of the system. •Learning by experimentation- Simulation and Inductive Learning modules. •Constructive feedback between these modules provides- knowledge refinement

9

Some Techniques- 5

By By: :S.Piramuthu S.Piramuthu etetal. al.1994 1994

3. Pattern Directed Scheduling (contd.)

+ Knowledge Modification + Computational time required is small enough for using it as a real time aid. 10

Some Techniques- 6

By By: :C.Chiu, C.Chiu, Y.Yih 1994 Y.Yih 1994

4. Learning Based Methodology for Dynamic Scheduling • Includes three modules: 1. Discrete-event simulation, 2. Instance generation: Searching of good training examples

using genetic algorithm.

3. Incremental induction: A tolerance-based incremental learning algorithm to allow continuous learning and facilitate knowledge modification.

+Modifies the knowledge contained in the rule bases. +Explores routing flexibility and handles uncertainties. 11

Some Techniques- 7

By By: : W.Zhang W.Zhang1996 1996

5. Reinforcement Learning Approach •A repair-based scheduler starts with a critical-path schedule and repairs constraint violations. •Temporal difference algorithm TD(λ) is applied to train a neural network to learn a heuristic evaluation function over states. •Reinforcement learning methods learn a policy + Can be used to repair a schedule when some unforeseen difficulty arises. - Formulation of scheduling task is tough. 12

Some Techniques- 8

By By: : Y.Yih Y.Yihetetal. al.1998 1998

6. Integration of Inductive Learning & NN •Developing multi-objective FMS scheduler. •Different classes for different objectives. •Competitive NN associate a simulation output with a class.

•Inductive learning is then used to obtain scheduling rules from this classified data. 13

Some Techniques- 9

By By: : Y.Yih Y.Yihetetal. al.1998 1998

6. Integration of Inductive Learning & NN (contd.)

+ Real time scheduling. + Knowledge modification. + Multi objectives can be handled. - Knowledge acquired can not be explicitly represented.

14

Conclusion

Knowledge modification is important when the scheduling environment is dynamic. Different techniques are well sited with different scheduling scenarios. •e.g if there are multiple objectives – Integration of IL and NN works well. •For handling unforeseen events- Reinforcement learning should be used.

15

Thank You !!!!

16

Heuristic Scheduling Based on Policy Learning - Google Sites

Heuristic Scheduling Based on Policy Learning - Google Sites

Suggest Documents

Heuristic Scheduling Based on Policy Learning - CiteSeerX

Heuristic Scheduling Based on Policy Learning - CiteSeerX

Q-learning based hyper-heuristic for scheduling system self ...

Agent Based Patient Scheduling Using Heuristic Algorithm

Bayesian Heuristic Approach to Scheduling

Preference-based Policy Learning

Heuristic Scheduling of Parallel Heterogeneous

PDF Heuristic Scheduling Systems: With Applications to ... - Google Sites

Selfish Allocation Heuristic in Scheduling: Equilibrium ... - Google Sites

hybrid heuristic-based artificial immune system for task scheduling

Heuristic Based Task Scheduling In Grid - Semantic Scholar

multiple dispatching rule based heuristic for multi-objective scheduling ...

DRAM Scheduling Policy for GPGPU Architectures Based on a ...

An SCP-based heuristic approach for scheduling ... - Semantic Scholar

Heuristic-Based Request Scheduling Subject to a ... - Semantic Scholar

Learning Blackboard-based Scheduling ... - Semantic Scholar

Reinforcement learning for utility-based grid scheduling

Deep Reinforcement Learning-based Scheduling for Roadside

learning-based scheduling of flexible ... - Semantic Scholar

Heuristic scheduling of jobs on parallel batch machines with ...

A Scheduling Heuristic for Streaming Applications on the Grid

Two Novel On-policy Reinforcement Learning Algorithms based on TD ...

Multiuser Scheduling Based on Reduced Feedback ... - Google Sites

Heuristic for production scheduling on job-shop ... - SciELO Colombia