#`((title . “Reverse Polish Lisp: A first foray into language design”)
(date . ,(string->date* “2016-07-24 15:00”))

(tags “project” “program” “language” “c” “flex” “bison” “yacc” “lexx”)

(summary . “My first toy language.”)

(project . “rplisp”)

Background

During the summer between my senior year of high school and my freshman year of college
	the strangest desire to write my own programming language overcame me. This would be the
	first of many times I'd feel this need. The idea first came to me while trying to figure
	out how to write programs for my HP-50G graphing calculator. 

HP calculators have, for
	many years, been the only calculator to perpetuate a strange method of input. Rather than
	having the user input equations and expressions in the standard "algebraic" way that is
	common to most calculators, the user has the option of entering expressions in Reverse
	Polish Notation. For example, rather than entering: " (code "1 + 2") " to calculate a sum,
	the user might enter: " (code "+ 1 2") " This makes for a much more efficient, and, once you
	practice it for a while, natural feeling method of input.")

Like most sufficiently fancy calculators, they also have a scripting language analagous
	to TI-BASIC on Texas Instruments calculators. Keeping consistency with their method of input,
	HP calculators use a language called RPL or Reverse Polish Lisp. I wanted to be able to
	develop and test RPL programs for my calculator on my computer, so I decided I was going
	to write my own interpreter. I quickly realized that this wouldn't be the easiest thing
	in the world to do because RPL consists entirely of "
	(a (@ (href "https://en.wikipedia.org/wiki/Threaded_code"))
		"threaded") " code with each function on the calculator being represented by a
	unique instruction. Because the code is stored and edited in this format I would have
	to write a custom editor or translator along with my compiler if I wanted it to be of 
	much use to me at all. In hindsight that doesn't sound so bad, but at the time it was a
	secondary goal. If I completed my language I knew I could translate it later.

The Language

This being my first experience with writing a programming language,
	I didn't want to have to futz about too much with writing my own
	parser so I used flex and bison to generate a parser and lexer for me. As it turns out,
	this ended up being almost as much work as writing the parser by hand. I'm sure that
	they make the process of writing parsers much easier once you're experienced using them,
	but it wasn't the best choice at the time. In following projects I was more careful to
	consider the learning curve of the tools I used.

The language is threaded and stack based, much like RPL. There are two stacks. the
	user stack and the program stack. The user stack is where user data is pushed on entry.
	For example, suppose the user enters " (code "2") " the top value on the user stack will
	be the value " (code "2")". The program stack is where program instructions are stored. When a token
	is lexed it is turned into an instruction and pushed to the program stack. The instruction
	at the base of the stack and works its way forward. Both of these stacks are stored in one
	large chunk of memory. The user stack is built up from the bottom and the instruction stack
	is built down from the top.

I don't remember the exact reason that I quit developing the language, but I seem to remember
	thinking that there were some, seemingly at the time, very difficult problems to face if I
	wanted to do more with the language. I'm confident that with what I now know I could improve
	the language more, but at this point I've moved on to other things. I think the next thing needed
	in the language would be user input and output functions. Now I don't think any of these would be too
	difficult, but I may have forgotten the reasons I didn't do them then.

Example Program
	
Here is an example program in rplisp to compute the factorial of 10."
	(pre ,(call-with-input-file "assets/factorial.rpl" read-string)))

Let's break this down. The first thing you need to know is that
	parentheses mean nothing. Chevrons (<<|>>), on the other hand, are
	a bit more important."
	(pre "<< 1 swap fact-work >> fact def")
	"The portion surrounded by chevrons is a snippet of code. Because
	It's in chevrons the whole bit of code is compiled and pushed to
	the stack as one instruction. " (code fact)" is a symbol. Because it doesn't
	yet have a definition it's pushed to the stack. " (code def) " is the define
	function. It associates the symbol fact with the code snippet previously
	pushed to the stack. So, now we have a fact function. In actuality it's
	a wrapper function for " (code fact-work) "."
	
	(pre "<< << dup 2 gt >> << dup 1 - fact-work * >> << * >> ifte >>")
	
	"This is the meat of the factorial function. " (code ifte) " is an
	if then else statement. It pops three code snippets from the stack.
	The last code snippet it pops is then run. If the top of the stack
	evaluates to true then it runs the second code snippet popped. Otherwise
	it will evaluate the first snippet popped. This code will duplicate
	the top value of the stack and compare it to " (code "2") ". If
	greater than two it will recursively call "(code "fact-work")",
	otherwise, it will multiply one last time and leave the result on
	the stack."

	(pre "10 fact print")

	Finally, we call " (code "fact") " on the value " (code "10") " and we print the result. 

					


Give it a try:

(pre
  		"git clone "
	(a (@ (href "https://github.com/rbryan/rplisp"))
		"https://github.com/rbryan/rplisp\n")
 		"cd rplisp\n"
 		"make\n"
	"./rplisp < factorial.rpl")
)))
Written on