As a software engineer, do you think your job requires the algorithm design and puzzle solving skills that your recruiter was looking for in your interview?
-
I haven't come across a situation in any project that I have worked on when I had to design a new tricky algorithm or when something didn't work out good enough on a project because I lacked good enough algo design or puzzle solving skills. It might be because the projects I have worked on are not challenging enough. Maybe for projects that are used on huge scale, developers constantly solve problems that require very good knowledge of algorithm designs and excellent analytical skills. If so, please can you share an instance during your work when you realised a good software engineer needs to possess these skills?
-
Answer:
Yes - I solve much more difficult problems working than I do in interviews. Over 20 years in industry I have not been asked about anything I haven't used for a job except math (Given a square dart board, is a randomly thrown dart more likely to land nearer the center or an edge?), logical reasoning (how many piano tuners are in Seattle?), and logic (why are man-hole covers round?) questions which were once popular and a few bit twidling questions (where the desired answer might be the arithmetic expression returning whether a number is a power of 2) questions people still use. Here are some examples. There are lots of duplicates I've omitted for brevity. Things like http://en.wikipedia.org/wiki/Graph_%28abstract_data_type%29, https://en.wikipedia.org/wiki/Finite-state_machine, and https://en.wikipedia.org/wiki/Concurrency_%28computer_science%29 get used for most interesting problems. https://en.wikipedia.org/wiki/Leslie_Lamport's happens-before relationship shows up in most distributed systems. I'm currently responsible for the data path in a startup doing enterprise online backup. Although we sell our product as http://en.wikipedia.org/wiki/Software_as_a_service I use a client side program to push updates to our cloud servers. I use a http://en.wikipedia.org/wiki/Reference_counting object for each file system object with a finite state machine controlling the synchronization which results in movement in and out of four http://en.wikipedia.org/wiki/Priority_queue . Its inputs come from its children plus two pairs of state machines - one pair for master copy, one pair for slave copy. Each pair has one for self state and one for children state. The objects are also stored in a http://en.wikipedia.org/wiki/Trie with http://en.wikipedia.org/wiki/Weak_reference back to their parents to allow objects to affect their ancestors' state machines. I use a http://en.wikipedia.org/wiki/Thread_pool_pattern for parallelism where threads can populate the tree from the master or slave copy, fill slave meta-data from a local cache I store as a file read and written sequentially, write the output cache and prune the tree, or perform other synchronization operations on filesystem objects. I limit in-memory tree size to manage the foot print and avoid paging with exceptions to avoid deadlock. Threads sleep on a http://en.wikipedia.org/wiki/Monitor_%28synchronization%29#Waiting_and_signaling when there's no work to do. On the cloud side I leverage Apache and its WebDAV module which incorrectly assumed its underlying embedded databases for dead properties and locking were multi-process and multi-thread safe (oops). I built a trivial key-value store for that using https://en.wikipedia.org/wiki/Optimistic_locking where operations for keys that weren't causally related to the last key state are rejected with that approach both performant and safe with very localized changes needed to Apache's WebDAV module. That includes indirection, data structures (trees and priority queues) in main memory, algorithms (traversing the tree to find work would be way worse than the priority queues), finite state machines, and concurrent programming which are usually covered in job interviews. At the previous startup building distributed object storage appliances I built the replication system. To support my data replication (simple master/slave is fine for a minimum sellable product) I implemented a distributed meta-data store / http://en.wikipedia.org/wiki/Distributed_lock_manager with both http://en.wikipedia.org/wiki/Eventual_consistency and http://en.wikipedia.org/wiki/Strong_consistency consistency modes. I used http://en.wikipedia.org/wiki/Paxos_%28computer_science%29for the strongly consistent mode. In the data path I fenced stale operations by http://en.wikipedia.org/wiki/Lamport_clock I did that using an http://en.wikipedia.org/wiki/Actor_model variant where I guaranteed message delivery order between each set of actors, approximated http://en.wikipedia.org/wiki/Closure_%28computer_science%29s where they could be bound to an actor's single thread or a thread pool for interfacing other subsystems, and allowed a closure to be applied on an RPC response (real from a response or synthetic from an epoch change, shutdown, etc.). I also invented an erasure coding scheme for https://en.wikipedia.org/wiki/Distributed_cache although we pivoted before I implemented it. I meant to get around to implementing state search in my simulated platform and model checker for validation but we pivoted before that (I did have pseudo-random testing with deterministic execution order with the varying timing and fault injection). That covers a lot of what you'd learn in an upper division distributed systems course and concurrency methods from Programming Languages other than the omnipresent shared memory thread model. Places like Amazon Web Services doing distributed systems ask questions about related topics interviews, and system software people are sometimes interested in event driven programming. At the startup before that building distributed block storage we were having problems with volumes going off-line due to timeouts (30 second hangs) and https://en.wikipedia.org/wiki/Bit_rot which I isolated to the meta-data system where changed entries were persisted in a journal and either discarded if undone (marking a page dirty then clean) or written to an on-disk array where the culprits were the random-IO flushes and never refreshing stable entries. I invented a circular log mixing new transactions with refresh entries written sequentially which guaranteed meta-data entries would be re-written elsewhere in the log before being over-written. That was up to 10,000 times faster than the existing system where the practical implications were no more timeouts and the ability to store a second copy (we used disk controllers with non-volatile write caches which would turn sequential IO streams intermixed with other IO into the corresponding sequential disc writes) with performance impact lower than the measurement noise floor even when deleting snapshots and changing lots of meta-data. At the first startup where I built distributed block storage I was the first engineer and had a lot more design latitude (but not much on product). The requirements I got were the same performance as direct attached I/O using commodity hardware, where the CEO did not consider a $300 disk controller or non-volatile memory board to be "commodity." I met the goals with good mean time to recovery by mating a https://en.wikipedia.org/wiki/Log_structured_file_system to a https://en.wikipedia.org/wiki/B%2B_tree/https://en.wikipedia.org/wiki/R_tree variant with reallocate on write for the disk block virtualization tables. I personally built the B+ tree code but delegated a lot of the LFS implementation so I could focus on distributed system issues and technical leadership. That covers data structures for secondary storage which sometimes come up in interviews for system software companies. If you want to do that sort of work you should note that each job was at a startup which begins with zero intellectual property and needs a lot of new development. You can even do that as a junior engineer at a startup because there can be more interesting work than senior people to fight over it - I hired my current minion straight out of school and have him working on finite automata among other things. Each position was also at a company where doing software better made the product work and sell more instead of being just a cost center. I've also worked with a few engineers lacking basic data structure and complexity knowledge with bad results. I tracked real-time issues in one job to a spot where one person used a sorted linked list (O(n^2)) instead of a heap (O(n log n)) for handling timed events and he could not understand what the problem was. The details have grown fuzzy with age, although I consulted for a company where I reduced something as bad as O(n^4) to O(n log n) which was not meeting its performance requirements - that VP of engineering thanked me for calling attention to the problems with his "resources" meaning staff. Don't be one of those people.
Drew Eckhardt at Quora Visit the source
Related Q & A:
- What kind of job will I have to do as a computer engineer?Best solution by Yahoo! Answers
- Can a computer engineer get a job easy?Best solution by Yahoo! Answers
- What exactly does a software engineer/software programmer do?Best solution by Quora
- What are the job scopes in Canada for a civil engineer? Will I get the job easily?Best solution by ask.shiksha.com
- What does a software engineer do?Best solution by tryengineering.org
Just Added Q & A:
- How many active mobile subscribers are there in China?Best solution by Quora
- How to find the right vacation?Best solution by bookit.com
- How To Make Your Own Primer?Best solution by thekrazycouponlady.com
- How do you get the domain & range?Best solution by ChaCha
- How do you open pop up blockers?Best solution by Yahoo! Answers
For every problem there is a solution! Proved by Solucija.
-
Got an issue and looking for advice?
-
Ask Solucija to search every corner of the Web for help.
-
Get workable solutions and helpful tips in a moment.
Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.