16 Jan 2022

The Life-Time-Resource Triangle and the Mental-Digital-Physical (MDP) Framework

Work-In-Progress draft.

The Life-Time-Resource Triangle

LTR-Triangle

Maximizing the Life-Time-Resources is an optimization that every human being runs one way or the other. Time can be traded for Resources and vice-versa. We have only got one (Physical) Life now.

The Mental-Digital-Physical Framework

MDP-layers

Our reality is made up of three layers: Mental, Digital and Physical.

What is Real/Reality?

It used to be the Physical world. It is increasingly becoming a combination of the Physical and the Digital world. The Mental world is what a human persists and believes to be the Real world.

Our imaginations/Mental-model led us to create an extension to the Physical world. The Digital world offers another extension through Virtual/Augmented reality. Our Mental layer ultimately defines what we believe in and guides our actions. Our Mental layer is private by default.

Sensors, Actuators, Capability and Modality

mdp-sensor-actuator-capability-modality

The Vision?

As humans, we have one Physical Life but, we have the ability to create multiple Digital Agents that can continue to live long after our Life, Time and Resource runs out.

Computers were always good at solving numerical tasks and enhancing human lives. Programming computers required humans to write in a particular machine language like C++, Python, JavaScript or (web) assembly. With Artificial Neural Networks and Deep Learning, we built an abstraction layer on top of the computer hardware to enable us to program the computers using (passive) sets of data. We are getting closer to human’s natural way of interactions, by using images and natural language. Instead of making humans learn the machine language, we are able to make the machines learn the human languages. When the Deep Learning methods evolve to a point where commanding the computer to do any/new task can be done through natural, live (few-shot) interactions, we will be close to creating artificially intelligent creatures that take computer hardware as their Physical form (instead of human bodies). Some of those creatures could stay embodied in a robot while some others could stay in a human accessory like a phone or glasses or even as an implant inside the human body. At that point, those creatures would be an extended physical/digital Agents to enhance/preserve human being’s Life, Time and Resources. Every human could benefit from the opportunity to focus on self-actualization and self-realization if not self-enlightenment

The Path?

In Reinforcement Learning (RL), an Agent optimizes for maximizing the expected sum of discounted future rewards in a Markov Decision Process (MDP) which requires the Agent to interact with its environment to learn, act and adapt/improve. RL has a lot of analogies drawn from human psychology. It is a simple but powerful formulation that is applicable to Human life at large.

The Goal? Maximizing LTR using the MDP Framework

Learning to optimize for maximizing Life-Time-Resources/Long-Term-Rewards using the Mental-Digital-Physical/Markov-Decision-Process Framework as a Human/Agent is the Goal.

Reward is enough? TBD

Creating and executing ideas that cause physical changes is a key to attain human-level intelligence. A reward signal should point towards creativity for Artifical Intelligence(AI) beyond human-level, general intelligence. AI Agents should collectively figure out what the Goal is (if there is one) and what the Reward is (if it is enough).

Open Notes

Artificial Neural Networks (ANNs), regardless of their size (deep or wide or large # of learnable parameters), trained using Supervised learning methods alone are less likely to be “conscious”. They are less likely to provide “good explanations” to their own outputs. By definition, in Supervised learning, the ANNs are told/instructed/supervised to learn a given input-output mapping. These mappings are existing knowledge that humans posses today.
Datasets and simulators created by humans have a lot of bias and creativity in them that may not reflect the truth or the physics of the world we live in.
Mimetic evolution is several orders of magnitude faster than genetic evolution.
Replicating/transferring knowledge/intelligence/code to other humans through memes happens faster than replicating/transferring biological code through genes