Tuesday, December 27, 2011

Classifiers for mobile devices

Imagine, for some reason, someone wanted to have on a mobile device a classifier that could recognize various object classes.  That is, if the system recognized face it would not be able to tell to who it belonged - but it would know that it was a face.  Viola and Jones provide a nice system (Haar-like classifiers) to do such classification for faces.  For anyone who wants to go further, Eigenfaces are a good place to start looking.

What I ask is: is there a way to rig a classifier based on Haar-like features to classify multiple features in parallel?  That is, can we efficiently detect 50+ object classes on an iPod/iPad in real time?  (I like developing on iOS, I know Android is more "open", but the Apple's tools are a pleasure to use!)

Derivatives are the solution.  I forget in which paper I read this, but the rectangular features that are subtracted from each other are like coarse derivatives using many pixel values for normalization purposes.  This is achieved by realizing that the Haar-like features form a basis that is over-represented (it's not orthonormal).

So - in other words the Haar-like features present a reduced/compressed form of the image which is convenient for recognition purposes.

According to my arithmetic, at lower capture resolutions, 10 passes over the image have to be done to scan a 24x24 window over the image (one pass per scale).  So we start with a 24x24 window that is slide across the entire image, then we increase it's size to say 28x28 pixels and slide it across the image, and so forth.  According to Viola and Jones the increment should percentage of the original size (as is the shifting).  I really recommend reading their paper: Rapid Object Detection using a Boosted Cascade of Simple Features.

What I started to ask myself: could a Haar-like feature be shared among multiple classes of objects?  Yes. The features are differences.  The varying thresholds to accept a feature may pose a problem, but that would be solved with how the data is organized.

But I have a bet.  An interesting bet.  If we have just a horizontal and vertical feature.  Could we restrict it to a square?  Yes...  That means that the first feature we pick is going to be a square (either horizontal or vertical split for the subtraction).  It also means that we have divided the space into 2 buckets.

Ok, next round - what is the standard deviation among the features from the training set?  That should determine the next round of features that will be placed spatially around the initial feature.  So we detect a rectangle, but that rectangle may belong to a larger feature which we must then scan for by checking surrounding rectangles.

However, we can not let the number of rectangles to test explode after the initial round.

By using this trick, Adaboost in the form that Viola and Jones describes would not be sufficient.  Rather, I believe rigging the features so they don't overlap may provide the solution that I need.

This is as far as my thinking has gone (except for a means of accelerating the training so it could be done on the iOS device as well - I've solved it and managed to find a way to reduce storage needs, but that is the simple part of the puzzle that I'm looking at).

Sunday, November 27, 2011

Haskell "to add or multiply" (slow version)

Today I'm rewriting this entire post (minus the code - it's very ugly code in retrospect).  So I'm trying to solve the "to add or multiply" problem from the ACM 2011 finals.  My hunch last week was that it could be solved in linear time.  This week I explored what I thought would be a linear solution and came back with some interesting notes.

Before I jump into the subject; the problem gives two ranges of integers (start and end), and two constants 'a' and 'm'.  Find arbitrary positive integers [i1, i2, i3, ... in] such that the following is contained within the range 'end':
(((start+i1*a)*m^i2 + i3*a)*m^i4 + i5*a) ....

(where arithmetic on ranges is identical to operations on a 2-vector.  a range consists of a start-value and end-value)

First, I now believe that there is no linear-time solution.  Intuitively, the reason is because there is a destination range and not a destination number.

Consider the inclusive range [a,b].  Now, there's a number 'x' that I can multiply an integer 'j' to get a value between a and b.  I also want 'j' to be minimal.  This is quite simple - if 'a mod x = 0' then the answer is a/x else it is 1+a/x.  Recall the computer always floors integer data, we would want the ceiling.

Within the problem, there's a point at which both j=a/x and j=1+a/x should be explored.  Below I'll try to informally explain this:

For the problem, it is possible to find a value 'a' that is the maximum number of multiplications that may be applied to [p,q] (the start range) until the values exceed [r,s] or the number of integers within the range of [p,q] exceeds that of [r,s].

The maximum number of multiplications is invariant no matter how many addition operators are present.  Simple example ('x' is a starting value, 'a' and 'm' are integers):
(x+a)m = xm + am

'am' will shift values a constant amount regardless of the starting value 'x'.  This is important, look at a range [2,3] -- [2,3] has 3 digits, [2,3]*4 = [8,12] has 5 digits.  We can add integers before the multiplication and the number of digits will not change.  This allows us to compute another value -- the minimum number of additions needed until a solution.

If it is not possible to find a series of additions that satisfies the start range and end range, then we can not conclude that there is no solution.  Consider that the increments of addition is 100,000 and multiplication is 2 with start range [1,2] to [5,20].

Neither is it possible to rely solely on multiplication.  Consider addition of 1 and multiplication of 7, start range [1,2] and end range [14,21].  The following is contained within the range: ([1,2]+1)*7

Knowing these two values helps us when searching for a solution.  The solution becomes:
x*m^j + i1*a*m^0 + i2*a*m^1 + ...

It's a matter of finding i1, i2, i... (j-1 unknowns).  If multiplication is 1 or 0, or if addition is 0 then the solution can be directly computed in constant time (depending upon your implementation of log - or if you decide to loop over values for multiplication -- which makes it O(n)).

Anyhow, a good strategy is to attempt to maximize the i for the m with the largest exponents.  Think of this as a heuristic.  When the number of integers can be in the tens of thousands, doing a brute-force search will be slow and memory consuming (simple arithmetic, just it will look like a mess here).

I'll continue playing with the numbers, maybe something interesting will pop out.

For historical purposes, here's a very slow / bad implementation of "to add or multiply":

import Data.Monoid
import Data.Char


-- AddOrMultiply
-- Given the ability to add 'a' or multiply 'm', see if there is a sequence
-- that starts in range [p,q] and ends in range [r,s]


data Operation = A Integer | M Integer


instance Show (Operation) where
show (A v) = " " ++ (show v) ++ "A"
show (M v) = " " ++ (show v) ++ "M"


apply :: Operation -> Integer -> Integer
apply (A v) a = (v+a)
apply (M v) a = (v*a)


data Partial = Partial (Integer,Integer,Integer) [Operation] deriving (Show)


applyl :: (Integer,Integer) -> [Operation] -> Partial
applyl (min,max) os = Partial (foldr apply min os,foldr apply max os,0) os


incrementMul :: Operation -> Integer -> Integer
incrementMul (M m) v = 1
incrementMul _ v = v


combineOps :: Operation -> [Operation] -> [Operation]
combineOps o [] = [o]
combineOps (A a) ((A as):xs) = (A $ a+as):xs
combineOps (M m) ((M ms):xs) = (M $ m+ms):xs
combineOps o xs = o:xs


add :: Operation -> Partial -> Partial
add o (Partial (min,max,mul) os) = Partial (apply o min, apply o max, incrementMul o mul) $ combineOps o os


validAdd :: (Integer,Integer)->Partial -> Bool
validAdd _ (Partial (_,_,0) _) = True
validAdd (a,m) (Partial _ ((A a1):_)) = a1 < (a*m)
validAdd _ _ = True


valid :: (Integer,Integer) -> (Integer,Integer) -> Partial -> Bool
valid am (r,s) (Partial (min,max,mul) os) = if (max <= s && max-min <= s-r && (validAdd am $ Partial (min,max,mul) os)) then True else False


iteration :: (Integer,Integer) -> (Integer,Integer) -> Partial -> [Partial]
iteration (a,m) rs ptl = filter (valid (a,m) rs) [add (A a) ptl, add (M m) ptl]


-- concatMap :: (a -> [b]) -> [a] -> [b]


startCondition :: (Integer,Integer) -> [Partial]
startCondition (p,q) = [Partial (p,q,0) []]


success :: (Integer,Integer) -> Partial -> Bool
success (r,s) (Partial (min,max,_) _) = if (min >= r && max <= s) then True else False


iterationl :: (Integer,Integer) -> (Integer,Integer) -> [Partial] -> [Partial]
iterationl (a,m) rs ps = concatMap (iteration (a,m) rs) ps


everything :: (Integer,Integer) -> (Integer,Integer) -> [Partial] -> [Partial]
everything _ _ [] = []
everything am rs ps = ps ++ (everything am rs $ iterationl am rs ps)


solutions :: (Integer,Integer) -> (Integer,Integer) -> (Integer,Integer) -> [Partial]
solutions am pq rs = filter (success rs) $ everything am rs $ startCondition pq


type Ampqrs = (Integer,Integer,Integer,Integer,Integer,Integer)


reformat :: (Integer,Integer) -> [Operation] -> [Operation]
reformat _ [] = []
reformat am ((A a1):(A a2):xs) = reformat am $ (A $ a1+a2):xs
reformat am ((M m1):(M m2):xs) = reformat am $ (M $ m1+m2):xs
reformat (a,m) ((A a1):xs) = ((A $ a1 `div` a):(reformat (a,m) xs))
reformat (a,m) ((M m1):xs) = ((M $ m1 `div` m):(reformat (a,m) xs))


solutionPartial :: Ampqrs -> [Partial]
solutionPartial (a,m,p,q,r,s) = take 1 $ solutions (a,m) (p,q) (r,s)


operationFromPartial :: Partial -> [Operation]
operationFromPartial (Partial _ o) = o


solution' :: (Integer,Integer) -> [Partial] -> Maybe [Operation]
solution' _ [] = Nothing
solution' am xs = Just . reverse . (reformat am) . operationFromPartial . head $ xs


solutionOp :: Ampqrs -> Maybe [Operation]
solutionOp (a,m,p,q,r,s) = solution' (a,m) $ solutionPartial (a,m,p,q,r,s)


solutionOp' :: [String] -> Maybe [Operation]
solutionOp' [a,m,p,q,r,s] = solutionOp (read a, read m, read p, read q, read r, read s)


parseLine :: String -> Maybe [Operation]
parseLine s = solutionOp' $ words s


display :: Integer -> [String] -> String
display _ [] = []
display _ [a] = []
display i (x:xs) = ("Case " ++ (show i) ++ ": " ++ (show $  parseLine x)) ++ ['\n'] ++ (display (i+1) xs)


main = do
contents <- getContents
putStr . (display 1) $ lines contents

Sunday, September 4, 2011

Sonic 4 EP 1 iOS Impressions

The price for Sonic 4 EP 1 has invariably dropped following the trend of most applications on iOS.  Sell for the maximum that the consumer will allow then drop to catch the cheapskates.  Others take the route of giving away a base application (or charging at probably a loss) and recuperating on paid extensions.

First, this game has 17 levels + special stages for each.  Ignoring the boss-fights and the shorter levels, that's about 11 levels.  Let me put the price of this game in perspective ($4.99) for 4 zones and 11 long levels (disclaimer, I didn't research anything about the profit margins of this game).   The first sonic game had about 8 zones, or 24 levels.  Just shy of double what this game offers.  And for much more than double the cost.  My point: people who say it's worth ($0.99) ... wake up and look at the development effort!  The levels appear to be huge + multiple sound-tracks -- and testing such levels is no small feat.  The two previous published iOS games I worked on would need to be more than ($0.99) to recuperate the investment given a target market.  (For those saying it appeared on multiple consoles -- it's not up to the other machines to finance the development for the iOS version).

For those complaining about bugs / crashing.  Email SEGA.  Complaining on the Apple store ratings page might not help.  I had problems with Civ.  Emailing (convoluted process) Firaxis gave me the answer I was seeking.

Most of the game reviews already cover every aspect of the game.  I shouldn't need to go into more details there.

What I'm about to rant about is that it's a cross-platform game.  A game designed for game consoles as well as iOS.  And that's the main fault I find with it.

The developers (to save time) probably (I'm guessing here) had separated the game from everything else.  All platforms support getting input.  It is the semantics of accessing the input device that changes for each platform.  XBox will go through XNA (there might have other options), Wii through (who knows what, it's proprietary), iOS through UIKit.  (Graphics, file access, used meshes, etc. may vary depending upon the platform but I'd guess that the game itself is portable C/C++/(other?) and what changes is glue that makes the whole thing run.)

So, you have a game.  It wants left-press, right-press, up-press, down-press, and action.  That's it.  That's how multiple input schemes become possible.  Don't like pressing the virtual D-Pad on the iOS device?  Then use the tilt sensor and swipe!  A myriad of other options could be presented with little effort (normally inundating the user with frivolous options is a bad idea -- power users may like it but they aren't representative of the whole however vocal they may be).

This works perfectly on the systems with physical controls.  The Wii, XBox, and PS3.  This fails on iOS (not miserably, but it's an issue).  If you don't realize that they are just mapping controls, confusion may ensue.

Why confusion?  Let's take a special stage for example.  They say tilt left and right to turn the world.  That's confusing, but it's part of the game.  Where it fails is -- wait if I'm holding the iOS device parallel to a table?  Should sonic act like a ball in a labyrinth game and slow down?  That might actually be more intuitive.  Further still, why rotate the world when the whole device can be rotated?  The game was programmed for left/right, it gets left/right from tilting left and right.  (I've done some testing with accelerometer controls, my conclusion being that they are the hardest to do right)

Jumping is also a bit jarring.  The button is statically placed.  It's a region.  It might be easier if the region covered the whole right-side of the screen.

Moving is also difficult with the D-Pad.  To conserve screen-space, it's on the left-hand side.  Pressing right can be done by pressing from the end of the go-left button to the middle of the screen (from basic testing).  I couldn't feel a neutral position (this is probably just me complaining for nothing).

Later on there are cannons to shoot sonic to a specific spot.  It may make more sense to tap in the direction the cannon should face, but it's again press left/right to turn and action to shoot.

Game-wise, Sonic should be on the left of the screen when moving right and vice-versa.  This gives the player the ability to anticipate what will happen when running (memorizing the level is fun and all...)

Actually, I had fun playing through the first half of the game.  It's not a bad game, they mapped controls in a very coarse way which takes away from the game a bit.  And I don't complain given the price, the older games cost much more even after release.

What I want to bring to light are the challenges of cross-platform development.  The game was perfectly abstracted away to work on the traditional consoles.  iOS is a different beast.  Certain changes might actually be too drastic and require too much reworking of the levels (the special zones are difficult since Sonic has a continual pull of gravity in a given direction.  Removing that may require rethinking of the levels.  It's more nostalgic this way though!)  You might want to aim the cannon towards one of the on-screen buttons, etc.

Saturday, July 30, 2011

First Day of 10.7

I decided it might be worth updating Mac OS to 10.7.  First, for anyone thinking of updating, double-check all of your apps.  I was surprised by the number of applications that I had that were PowerPC.  I think the machine was spending more time running Rosetta than x86 binaries...  The important apps I found newer versions and trudged forward.  My scanner will only be usable with an older Power Mac.

So.  How does the OS fare?  Here are the positive and negative features on a per-app/feature basis in my opinion.

Mail.app
Mail.app's interface update is great!  The application is now designed for wider screens.  On the far left is the side-bar as it was there before (hidden by default but click on "Afficher" -- I guess that's "Show" in English).  Organized like it is, stretch the application and there is plenty of room for the previews and other information.  The preferences allow for plenty of customization.

iCal
I like the way it looks.  Where Mail.app got a massive functionality upgrade, iCal seems to suffer.  The problem is that things normally aren't neatly split up between months.  So if I'm trying to schedule something it won't be for a specific day but some time-range.  So I want to quickly jump between weeks (if possible see multiple weeks) and not be stuck in groups a day/week/month/year.  (the option to scroll a day at a time is silly - it renders the velocity scrolling of the magic mouse useless)

For example, if in the week view it scrolled continuously and could display multiple weeks in it's columnar view I'd be very happy.  For the day view, for a sufficiently wide screen multiple months can be seen.  Even multiple days.  On a small 1024x768 screen the spacing is elegant, however I'd argue there is space for two more monthly calendars in day and week view.

Finder
Overall, it's as usable as ever.  I can't complain, it feels normal.  A few settings I had to re-enable, but it's as I expected.  The overall view of all my files is... pointless.  It seems to be randomly picking stuff I downloaded -- such as a picture generated by Doxygen.

I'm happy that the library is now hidden.  To many things could go wrong by having that exposed to the user.  Those of us that wish to muck with system stuff can hit Command-Shift-G.  The icons are much clearer.

Terminal
Works.  Happy.  Why can it go fullscreen?!

LaunchPad
I don't know what to make of this.  LaunchPad attempts to present a nicer view of the applications.  Sure, that's great!  The question that's running around my head is: why isn't LaunchPad's view synchronized with the Application folder?

For example, a folder in Applications becomes a folder in LaunchPad.  Of course there are issues with nesting folders but I don't see why folders nested more than one couldn't just be collapsed.  Certain applications would be dangerous to move (bad developers)...  However organizing my applications once is, in my opinion, preferable.

Actually, I would have loved LaunchPad if it just opened a fancy view of my Applications folder.

Apart from that, it's a great idea.  Only if I didn't use the dock to store often-used applications and Spotlight for everything else.  (StarCraft II?  Spotlight!)

Mission Control
A very good update to Exposé.  It unifies all the window management features into one nice spot.  One button to see all my windows from all the apps with all the desktops + fullscreen apps.  Then, windows from the same application are grouped.  It's very nice.

Fullscreen Apps
Really.  I wanted to love this.  I have a small-ish monitor on the side where I tend to throw documentation, iTunes, Mail, and the web browser.  The main monitor is reserved for work (XCode, iOS Simulator, etc.).  Never doubt having the documentation on a second monitor!

Fullscreen apps grey out the second monitor.  As in, they don't use it!  And I can't put floating windows on it if applications on the main monitor are full screen.  XCode, which I thought would actually use both monitors (put the Organizer on the second!) spawned the Organizer as a new full-screen window.

Use multiple monitors?  You'll go further running the applications in windows.

Safari
The download window is gone and replaced with a pop-up...  Thanks Apple!

Unfortunately flash videos skip now when the system is under load...

XCode
The documentation within the Organizer is still a mess.  It's more convenient browsing to Apple's site since the side-bar should be synchronized to the documentation (or a viewable side-bar should be there).

No, I don't want Quick Help.  Quick Help doesn't even pick up on the Doxygen comments littering my code.

Last Impressions
The Mac OS / iOS hybrid seems to be an odd beast.  The ability to quit and resume applications is wonderful.  But the addition of fullscreen applications and launchpad feel like kludges.  (LaunchPad especially feels like it's tacked on rather than integral).

The scroll-bars...  This is minor.  Use it for a day or two with the default settings.  Within a few minutes I was already used to the scrolling.  The elimination of scrollbars didn't affect me:  I never used them anyhow.  Give it a chance for at least a day.

Autocomplete -- it's annoying when it fails.

Those are my comments from one day with the system...

Friday, July 29, 2011

Software: From Simple to Complicated

A few years ago, I started a small software library.  A set of common routines.  Seeing how useful it was, I decided to expand it -- make it more general.  Apply good software development techniques so that it may prove to be more flexible.

For example, at the beginning I hardcoded the ability to use a single 1024x1024 texture for everything.  This meant I had no need to worry about which texture was currently bound (only one) and no need to manage memory (one fixed amount of memory used).  Then I expanded this system to load multiple textures using a plist for the parameters.

For the added flexibility, I was able to create textures and use them as objects.  Yet; as far as making prototypes go it didn't speed things up.  Memory became an issue to manage.  And now mipmapping and other little technological ideas for the sake of technology start to creep in when ideas should be driving the technology.

In the end; I should conclude the organization for generic code is only needed when called for.  If it works, and works well, why change it?

Wednesday, July 27, 2011

StarCraft II: Beating the AI on Hard

Watching others play StarCraft is the best way to learn.  And I have successfully beat the AI on hard using all of the races (1 vs 1).  And the pattern is actually quite simple (on Blizzard's maps at least).

First; we should realize the harvesting is the main bottleneck.  If you have to wait a second to build a unit (unless the game has just started) then something is wrong.

The other bottleneck is production speed.  If you have too many resources and can't spend them on units, something is wrong.

So; here's the overall build order:
1. Get up the 20 units harvesting the mineral fields provided at the beginning.
2. Zerg: get Zerglings (also, build a queen as soon as possible), Terrain: get marines (build two barracks), Protoss: Zealots (build two warp gates).
3. Set up a second base (no air units at this time).  Get 20 units harvesting the second mineral field (while producing a steady stream of offensive units -- your first test is surviving the first wave).
4.  For Zerg start focusing on roaches and move towards hydralisks (air defence) -- if all goes well you have so many minerals to waste that building a third or fourth hatchery for extra unit production is worth-while.  For terrain, build a factory for stronger units, however you should be able to afford 4-5 barracks producing a continual stream of marines.  Similarly, for protoss, 4-5 warp gates can produce plenty of units (but the zealots won't be enough, vary it a bit with them.)
5.  Send waves of units (if there are enough) into enemy territory (scouting may be needed).  I let them die off and focus on preparing the next wave.  If you have the talent, you might want to call back weak units.
6.  Prepare a final wave composed of tougher units (ultralisks, for example).

And victory should follow.  This just rushes the AI with too many units.  It can fail, but it tends to succeed.  If you aren't sure why things went wrong, look at the replays and see how the AI played.

Tuesday, July 26, 2011

Fast Integer Square Root

Here's my attempt at writing a fast square root function for integers.  It's not as fast as the built-in sqrtf function (even with type conversions) but it's decent.  In the worst case it was twice as slow -- but that would downplay the significance of the learning experience.

First, I try to estimate a point near the root.  Consider a number in binary, it's square root will be between two to the power of half of the length of the binary string rounded up and double that value.  The lower bound can be made more accurate, but this is sufficient.

Then, with the maximum and minimum, the midpoint should be a good starting point or guess that I feed into Newton's method.  Newton's method is used to increment or decrement the initial guess to something a bit saner.

More speed can be obtained by replacing some division by bit-shifting (but that's dangerous).  The fastest (and smartest) way would be to use the built-in SSE functions to do the square root and inverse square root (properly pipelined).


//! Same, but this time do Newton's method
int sqNewton(int in_val)
{
int c = in_val;
if (c <= 0) return 0;
int lower = 1;
if (c >= (1 << 16))
{
c >>= 16;
lower <<= 8;
}
if (c >= (1 << 8))
{
c >>= 8;
lower <<= 4;
}
if (c >= (1 << 4))
{
c >>= 4;
lower <<= 2;
}
if (c >= (1 << 2))
{
c >>= 2;
lower <<= 1;
}
//Hit mid-bound
lower = lower*3/2;
//printf("%li %li %li\n", lower, in_val, lower*lower-in_val);
int numerator = lower*lower + in_val;
int denominator = 2*lower;
return numerator/denominator;
}

Monday, July 25, 2011

Rant on P versus NP

The weekend spent on absorbing information relating to P's relation to NP, I shall now solidify my understandings through regurgitation of what I've understood.

First, the types of texts that I've found either went deep into the topic (from the theoretical perspective) or brushed lightly upon it (the way a book on programming treats mathematics).  I'll try to walk a fine line between both while maintaining clarity.  Hopefully it works.

What are P and NP?

These have to do with timing.  Not in seconds, but in 'effort'.  For example, consider sorting books in a book-shelf.  You could scan through the 'N' books to sort, find the smallest one (or whatever sorting criteria you want), and put it at the beginning of the shelf.  This operation could be repeated for the remaining books.  So, to sort 'N' books, you'd look at (on average) 'N/2' books to find the next one to place to the left of the shelf.  That involves looking at 'N^(N/2)' books.  In the big picture, we say 'N^N' (N to the N) operations will be needed.  As the number of books increase, this method of sorting will be excruciatingly slow.  It is left as an exercise to the reader to translate looking at book titles to actual time.

They also have something to do with 'polynomial time'.  That is, a number that can be expressed in the form of 'N^x' where 'N' is the number of elements to work on and 'x' is a constant.  Sorting books according to our previous example takes much longer than polynomial time (think of very large 'N').

P looks at problems that take polynomial time.  Specifically, deterministic problems.  Typically involving Turing machines (but I don't want to go into the details of tapes).  Essentially, any problem whose solution expressed as a series of unambiguous steps takes polynomial time is in P.

And NP also looks at problems that take polynomial time.  This time, problems whose solution is expressed non-deterministically.  Back to our book sorting problem, we looked at 'N/2' books (on average) to find the next book to place on the shelf.  But what if we just 'looked at all the books at once' and happened to pick the right one.  Something similar goes on with 'Oracle machines'.  However, NP is easier to understand as the time it takes to verify the problem.  In other words, our book sorting would take 'N' time in non-deterministic polynomial time (NP-time)

And why is P versus NP important?

The crux of the problem: are all the problems in NP also part of P?  Essentially, NP problems are easy to verify but can be hard to compute.  Like our sorting problem, we only need to inspect each book once to confirm that they are sorted.  Similar ideas are used in cryptography -- a very difficult problem must be solved by the person who attempts to intercept the message.

Problems in P are those that are practically solvable (well, practically this is debatable, theoretically this is true).  There are problems that are much more difficult.  Let's call these NP-Complete (ok, technically NP-Complete problems are a class of problems where if one is easily solved then all of them can be easily solved.  They are very difficult to solve though).  For example, given a series of numbers, find the subset whose sum is the greatest.  Short of attempting every possible combination of numbers, a solution is hard to find (programmers may chime in with 'dynamic programming' rolling off their tongue.  But, that just stores intermediate results, all potential combinations are still evaluated.).

Back to encryption, if P = NP then there is a way to easily decrypt data.  From what I read, a lot of things would become much easier.

But that is the essential nature of P = NP as far as I can tell for now.  I might change my mind as I become more informed.

Friday, July 22, 2011

Haskell to Solve Programming Challenges

There are times when I do silly things.  This is one of them.  Here are my solutions to the first 3 problems for the June 2011 Waterloo Local Contest (http://plg1.cs.uwaterloo.ca/~acm00/) -- not the most difficult, but good enough given my knowledge of Haskell.

So...  Let's start with the game of 31.  Just call the method `finishGame` with the input string.  It runs quite fast.


import List


-- Get the score from a list that is the game
scoreForGame :: (Integral a) => [a] -> a
scoreForGame x = foldl (+) 0 x


-- Change an int into a list
gameFromInt :: (Integral a) => a -> [a]
gameFromInt 0 = []
gameFromInt x = (x `mod` 10) : gameFromInt (x `div` 10)


-- Obtain the player for a game
data Player = A | B deriving (Show)


playerFromGame :: (Integral a) => [a] -> Player
playerFromGame x = if (even $ length x) then B else A




otherPlayer :: Player -> Player
otherPlayer B = A
otherPlayer A = B




-- Score a single move
scoreMove :: (Integral a) => a -> [a] -> a -> a
scoreMove score deck play
   | score + play > 31     = -1
   | otherwise             = if (elem 1 $ map (scoreMove (score+play) deckMinusPlay) (nub deckMinusPlay)) then -1 else 1
   where deckMinusPlay = delete play deck


-- Generate deck
deck :: (Integral a) => [a]
deck = [1..6] ++ [1..6] ++ [1..6] ++ [1..6]


-- Finish the game!
--    Takes the game and writes out the winner!
finishGame :: (Integral a) => a -> Player
finishGame a = if (r > 0) then p else otherPlayer p
   where
      d = deck \\ sort g
      g = gameFromInt a
      s = scoreForGame g
      p = playerFromGame g
      r = if (elem 1 $ map (scoreMove s d) (nub d) ) then -1 else 1


Ok.  That actually wasn't that bad.  And it worked.  Some people might complain that scoreMove returns 1 if the current player wins else -1 -- and keeps flipping the numbers between invocations.  Note that I check to see if the element is 1 rather than the more intuitive maximum as this allows Haskell to do lazy evaluation (on the first 1 found within the map over elements in deck stop processing since the player won!).

The \\ operator was a challenge to find.  It's great!  In `List', it subtracts the elements between lists.  Eg, [1,2,3,4,5] \\ [2,3] = [1,4,5].

On to problem 2.  Factoring large numbers.  Actually, using Haskell is probably cheating as you need to maintain a list of prime numbers to at least finish in 30 seconds (Haskell cached it for me)...  The code is quite simple:


import Control.Parallel


-- Generate an infinite list of primes?!


divides :: (Integral a) => a -> [a] -> Bool
divides _ []   =  False
divides n (x:xs)
   |  x*x <= n    = (n `mod` x == 0) || (divides n xs)
   |  otherwise   = False


firstPrime' :: (Integral a) => [a] -> a  -> a
firstPrime' (p:primes) value
   | (value `mod` p == 0)  = p
   | p*p > value           = value
   | otherwise             = firstPrime' primes value


firstPrime :: (Integral a) => a->a
firstPrime = firstPrime' primes


nextPrimes :: (Integral a) => a -> [a] -> [a]
nextPrimes trial primes
   |  (divides trial primes)  = nextPrimes (trial+2) primes
   |  otherwise               = trial:(nextPrimes (trial+2) (primes ++ [trial]))


primes :: (Integral a) => [a]
primes = 2:3:(nextPrimes 5 [3])


-- Return all the factors of a!
factor :: (Integral a) => a -> [a]
factor 1    = []
factor a    = p:(factor $ a `div` p)
   where p = firstPrime a


main :: IO ()
main = do
         print $ factor 90
         print $ factor 1234567891
         print $ factor 18991325453139
         print $ factor 18991325453143
         print $ factor 18991325453141
         print $ factor 12745267386521023


One comment I should make, lazy evaluation was essential!  Also, I am aware that (primes ++ [trial]) will do a complete list traversal of primes to append a single element.  Without that, the recursion was destroying the performance on the divides and firstPrime'.  In total, it's about 19 seconds to run.  Slower when more CPU is used...

Next off!  Computing the fibonacci sequence.  Once it hit me that I should try to avoid recursion (it's so natural) then I got this very nice and very fast solution:


import Control.Parallel


fib :: (Integral n) => n -> n
fib 1 = 1
fib 2 = 1
fib x = (fib $ x-1) + (fib $ x-2)


fibList' :: Integer -> Integer -> [Integer]
fibList' m1 m2 = m2:(fibList' (m2+m1) m1)


fibList :: [Integer]
fibList = (fibList' 1 1)


fib'' :: [Integer] -> Integer -> Integer
fib'' (x:xs) n
   | n <= 1    = x
   | otherwise = fib'' xs (n-1)


fib' :: Integer -> Integer
fib' = fib'' fibList


main :: IO ()
main = do
   print $ fib' 100
   print $ fib' 997
   print $ fib' 1000
   print $ fib' 1009
   print $ fib' 4500


I think for this 3rd problem, the challenge was to work with very large numbers efficiently.  Which Haskell does automagically.

Why infinite lists?  lazy evaluation is the answer!  I find it makes the algorithm cleaner even though a few things had to be shuffled around.  I did have fun typing fibList and seeing all the numbers in the fibonacci sequence appear.  And hopefully, written in this way it can cache intermediate results (but it runs so fast I don't think it even matters)!

Done a few programming challenges for the day!

Sunday, July 17, 2011

Branding the Perfect Image

Perfection.  The opposite of being human.  How interesting.

We accept nothing but perfection from the leaders of the world.  Some brands attempt to portray themselves as perfect, hiding faults to promote products (unless legally obliged).  And now branding seems to be entering the world of social media, with people participating in campaigns for some reward (or without realizing it).

As people get brought into this mess of branding / advertising, their language must be monitored.  Bad things may not be said (in the past nor the future as everything is recorded for infinity - easily searched).  Even a minor mistake, redacted, can spread like wildfire as it gets cached (remembered) within various minds, machines, and search engines.  Who will control what this legion of unwashed masses does to help promote a brand?

Why do I bring this up?  Some people use social media to socialize, voice honest opinions.  Others use the media to present an image of themselves.  The latter is useful for brands as they play the game of perfection. The former can get the message across but will not let hesitate to post an honest idea.  SLAPPs are probably the best vehicle to keep everyone in line.

What makes the social media useful are the honest opinions.  Those that call out defects, problems, lemons if you will.  Attempting to use this word of mouth to spread a fixed message appears to be restricted to the set of people willing to appear perfect.

People aren't perfect.

Sunday, July 10, 2011

In Search of Software Simplicity

15 years ago I was making small video-games as a hobbyist.  The games were simple, pushing QBasic to its limits.

Jump forward to today, 2011, and I'm still making video-games as a hobbyist.  The software development landscape has greatly changed.  Things look nicer, and the complexity of software development has jumped.  I try to keep up by abstracting away concepts.

The last concept I tried to abstract away was physics.  Of course there are plenty of physics engines out there, but I believe it is best that I understand how these work.  And more often than not, they are too powerful for what I wish to develop.

Even my simplistic set of objects are often times overkill for what I wish to do...

The last object I created, which I should have created a long time ago, is called "Angle".  It allows me to take and compare angles (even though they only exist between -PI and PI).  Something that makes my software more error prone and has greatly helped in reducing bugs.

Another object that I have is "Particle".  It's a point in space (2D or 3D) that has certain physical properties such as mass, velocity, etc.  A character is in essence a particle (it shares the same properties).  But, it can not go through anything.  So, should I infuse, within particle, all the potential logic to handle the case where it is within a bounding box that can collide with this specific world, or should I implement a custom type of object.

The arguments against the custom object are well known.  Code reuse.

But the arguments for creating a custom object are also strong:  Polluting a clean code-base for a one-time-use object is not worth it.  The particle object need not be that complex - leaving it simple means I can more easily reuse it when the time comes.

What if something else needs this code?  Then I'll worry about abstraction.  No use abstracting away something when I'm prototyping.  The goal isn't a library but something to test the feasibility.  The library is just a nice side-effect that happens when I try to keep track of the big picture.

Maybe code reuse is not the ultimate in software simplicity.

I'll leave this rant on one final note:  there once was an article (I believe the Daily WTF) which talked about the added efficiency of using the built-in functions to delete folders rather than recursively deleting folders.  Then it came to me, wouldn't it be more obvious if the right solution were to combine "fmap" with the "delete function" and a "file list"...

Complexity from combinations of simple elements, what a great thought!

Friday, July 1, 2011

My Latest Haskell Experiments

Many moons have risen in the night sky as I tried with great difficulty to become a Haskell master.  Lacking knowledge of category theory and Monads being this abstract notion with no concrete definition.  My infatuation with the language began with the desire to write a better programming language where I could express select computations and generate code to optimally handle a variety of data structures.  Haskell expressed purity in its functioning and strict type checking.  Winner it clearly was.

This year, marking at least the third, I have again revisited Haskell.  No language has managed to elude me before.  I will figure this one out.  This year, a book I have invested in.  A book I have read cover to cover. Another attempt at writing a code generator followed.

As I programmed Prolog, my mind always translated the code to it's sequential equivalent.  If this is done, then all possibilities are recursively enumerated until a solution is found.

Haskell, with Monads, is another story.  My problem was this silly attempt to maintain an imperative mindset rather than thinking functionally.  Hence, now I will document what I have discovered -- as many must have done before.  For someone wiser than I who can extrapolate the interesting portions and run forward -- someone who wishes to see how the thinking differs.  For tutorials, many have written much better texts than I could.

I'll start with the concept of laziness.  Haskell is lazy.  In an imperative language, suppose you had a for loop.  As long as it writes to a memory location (or potentially could write to a memory location) it will be run.  Haskell takes a goal and follows a chain of dependencies until it has finished the calculation at hand.  Back to our for loop, this means the compiler (or language) would need to provide a way to determine if a given write to a memory address will actually impact the resulting computation in the future.  This is possible for certain local computations on the stack, but anything dealing with shared objects is next to impossible.  Forget about static memory locations.

If we logically follow this path of thought, it means that Haskell has some restrictions on the language to allow it to do such reasoning.  Two things come to mind.  First global non-constant variables must go.  Suppose an object were to change a global variable, how would it know that no other object were to access that variable?  It becomes impossible with libraries unless everything must register for the data it will read from a-priori.

Actually, for the second thing, local variables as well should be constant.  Parameters passed to functions should be constant.  As should be the return values.  Essentially, all data becomes immutable.  Why do we want this?  All computations can now be represented as an acyclic graph.  You can start with a variable -- it is computed by calling a function, let's say.  That function's input is known, and the result is computed from information that is known.  So, let's say every computation that must be done for any value is known.  Then all that you need is to look at the graph of computations, only compute the nodes (that hold the operations) whose result benefits the desired computation at hand.

Some may wonder about determinism.  That is covered by the second thing.  If every variable is constant and can only change by creating a new constant whose value is computed by a series of predictable constant values then a function will always return the same output for a given input.

Implied by determinism and constant everything is the need for recursion.  Why recursion?  It creates more stack space to put constant values.  It also creates a framework to destroy values when they are no longer needed.

So that leads us to state...  how do we manage state?  Here, Haskell is very elegant.  Imagine you have an object that represents state.  Manipulating that object returns a new state object.  That old state object is discarded and the new one is used.  Haskell provides ways to automate this process.  This is the last piece of mind-bending...

Consider a multi-parameter function that returns a single value.  Suppose you gave a single parameter to the function and a new function was created with one-less parameter.  Let's call the first parameter the state and the return value is a combination of the state and some value (which we have yet to define).

Now, let's create functions that work on two inputs (the state and the data) and outputs two things (the state and the data).  The first function takes in a function, state, data, and outputs state and data.  Suppose the function to our function just manipulates data based on the current state.

A bit mind-bending, but now the state management is separated from the computations.  This doesn't always work, but when it does it's quite neat.  Error handling can be simplified in a similar way.  The error state can be a container that checks return values to make sure they are valid.  Push the idea further, since it's a function calling a function doing the actual work, the function that does the calling can first ensure that only non-error states can continue computations.  Want an exception state?  sure, have a function that takes two functions (the second only gets called on error).

Let's push this a bit further.  Our error functions last parameter is the data, but the first is the state.  Recall that we can assign values to the first parameters to create new functions?  Now, push this further and say assign a function with the promise to evaluate the results if successful.  In C this might look like: eval(func1, data1, eval(func2, data2, eval(func3, data3, ...)))...  Messy.   In Haskell: eval func1 data1 $ eval func2 data2 $ eval func3 data3.  Read it backwards...

Isn't that cleaner?  I'd need to get back to the book to give further insights as I'm too quickly jumping into Monads...  But, I think the point is made.

Haskell glues functions within functions (think containers) to abstract away computations.  Error handling can be seen as a series of functions.  As is state maintenance.  The typical machinery for this is a Monad.

Ok, quickly.  Monads.  Monads just say that a given object (some type of state) wraps another object (computation).  They define operators that must function for each Monad, and each Monad must conform to a set of rules so they behave as other programmers expect.

First, a Monad should be able to take a computation object and wrap it in a state object.  Second, given a function operating on the data, the Monad should be able to do the computations exactly as though they were done devoid of state.  Last, computations should give the same result no matter the order.  (a*b)*c = a*(b*c).

Well, something like that.  The concept, for a person used to the imperative style, should see a consistent use of the class adapter pattern.  The adaptee is the computation, the client is the state.

Saturday, April 23, 2011

Organic Game Creation

What makes a video-game fun?  Could it be that the construction process embeds itself within the gameplay?  A fun construction process yields a fun game.  A boring process yields more of the same (not that it isn't good, but not great).

For the past year, I've been trying to make games whose control scheme was easy.  Focusing on usability.  But that's not how I design games.  Design is such a harsh word filled with connotations which are negative in this discourse and my lines of reasoning, let's say "create".  If I were to paint a portrait, I would not design that much.  I would plan a general feel for the portrait, and let the portrait come to life through the process of adding lines.  Wouldn't it be silly to plan where the lines go and send it off to someone else?

The best games that I have created were the ones where I did not care about usability initially.  They were intended to be fun and challenge people.  Some were easy.  Others very difficult.  They each started as a sketch, an overall idea of how things were placed.  Then details got filled in gradually.  Adding features that I (egoistically) enjoyed, and removed others.

Like when I draw, I can never predict the final outcome of a game I create.  There is a sense of what it may look like, but the final version is always a surprise.  It took me at least a year to figure out this process which counters the current industry standard.

Now to push forward my prototypes and see which survive.  May the best be submitted to the App Store (and hopefully accepted) for all to enjoy.

I can not yet fully argue about process being reflected in gameplay, but I am more than certain now that it is the case.