Due: Tuesday Apr 15 before 9:00 pm
Objectives
- Practice constructing and using heap data structure as a priority queue ADT.
- Practice writing merge operation in skew and leftist heap data structures.
- Gain additional experience constructing and using binary trees.
- Practice using recursion in programs.
- Learn to use function pointers.
- Practice writing unit tests.
- Practice working in a Linux environment.
- Practice analyzing and understanding a project requirements.
Introduction
There is a very large number of messages posted on social media every day. The apps need to present the messages to the users in an organized way. Different social media apps take different strategies for presenting messages. A design team decided to test multiple algorithms for prioritizing the messages to be presented to a user. Your job is to implement a priority queue ADT for organizing the posts based on multiple priority algorithms. The design provides multiple implementations of a priority queue ADT such as different types of heap data structure. The type of heap can change at any time.
The application automatically prioritizes the posts. The prioritization is based on some specific criteria. The algorithm that uses the criteria to calculate the priorities can change. Therefore, the application should have the possibility of changing the post priorities. An algorithm that can determine the priority will be implemented in a function. Such an architecture allows us to use different priority functions.
Heap
Skew Heap
A skew heap is a specialized version of a heap data structure which performs the insertion and deletion operations in O(log n) amortized time. This data structure is a binary tree in which the root always holds the node with the highest priority. Skew heap uses merge operation to perform insertion and deletion.
The major operations supported by a skew heap are insertion of elements, reading the highest priority element, and removing the highest priority element. Reading the highest priority element is just a matter of reading the root node of the heap. The other two operations, insertion and removal, are applications of the merge function:
- To insert a new node x into an existing skew heap H, we treat x as a single-node skew heap and merge it with H.
- To remove the node with highest priority value, we delete the root node and then merge the root's left and right sub-heaps.
The special feature of a skew heap is the merge operation which combines two skew heaps into a single, valid skew heap. Let p1 and p2 be positions in two skew heaps (e.g. pointers to nodes). The merge operation is defined recursively:
- If p1 is Null, return p2; similarly, if p2 is Null, return p1.
- Assume that p1 has higher priority than p2; if not, swap, p1 and p2.
- Swap the left and right subtrees of p1.
- Recursively merge p2 and the left subtree of p1, replacing the left subtree of p1 with the result of the recursive merge.


Leftist Heap
A leftist heap is similar to skew heap and it uses a merge operation for insertion and deletion. However, it maintains a specific property and it uses the property during the merge operations. Every node in a leftist heap stores a value called Null Path Length (NPL). This value presents the lowest number of edges from the node to a null node. In the other word the NPL value represents the shortest possible path from a node to a null node.
During a merge operation if the NPL value of the right child is larger than the NPL value of the left child we swap the children. This generally makes the left subtree heavier than the right subtree.
Assignment
In this project, you will implement a priority queue class based on a skew-heap or a Leftist-heap data structure; it can maintain a min-heap or a max-heap based on the computed priority for every node, where the priority function is provided to the queue constructor via a function pointer. Inserting to and extracting from the skew/leftist heap uses a heap merge function which guarantees that the heap property (min-heap or max-heap) is maintained; the comparisons that are part of the merge process will be made on the computed priorities for the objects in the heap. The queue class allows for the priority function to be changed; in which case the heap must be rebuilt. Moreover, the data structure can switch between a skew heap or a leftist heap at the request of the class user. Such a request will trigger reconstruction of the heap.
For this project, you are provided with the following files:
- squeue.h – The interface for the Post, and SQueue classes.
- squeue.cpp – A skeleton for the implementation of the queue class.
- driver.cpp – A sample driver program. It contains sample use of the queue class.
- driver.txt – A sample output produced by driver.cpp
Specifications
There are two classes in this project. The class SQueue has a member variable of type Post.
Class Post
The implementation of this class is provided to you. You are not allowed to modify the class. This class stores the following information about a post:
- m_postID stores a unique ID for every post.
- m_likes stores the number likes for the post.
- m_connectLevel stores a number presenting the distance between the poster and the user who will see the post. The value 1 indicates that the poster and the viewer are directly connected. The value 2 indicates that there is third person between the poster and the viewer. There can be up to 5 connection levels.
- m_postTime stores the time that a message was posted. This variable can take a value between 1 and 50. It indicates the placement of post. Using this value we can see what message was posted earlier comparing to other messages. For example, the value 50 indicates that the message was the last one posted.
- m_interestLevel stores the level that a post may be in the interest of the viewer. The value of 1 presents the least and the value of 10 presents the most.
Priority Functions
A priority function is a user defined function that can use the information in the class Post to determine a priority value for a post. Two example functions are provided in the driver.cpp file. You can define your own priority functions. In fact, different plants may have different criteria for this purpose. The ability of using different functions allows us to adapt to different priority algorithms. The heap can accept a prioritization function. This is accomplished by passing a function pointer to the constructor. The function pointer is the address of a function that will be used to compute the priority for posts. The function must take an Post object as input and must return an integer priority value. A typedef for the function pointer is provided for you in the header file:
typedef int (*prifn_t)(const Post&);This says that prifn_t is a pointer to a function that takes a Post& argument.
Note: The implementation of this function is provided to you. You do not need to modify it.
Note: The implementation of this function is provided to you. You do not need to modify it.
Overloaded Insertion Function
There is an overloaded insertion function for the class Post to help you debugging the project. The implementation is provided to you. You do not need to modify the function.
Class SQueue
The class SQueue constructs a skew or a leftist data structure of the type min-heap or max-heap. This class has a member variable called m_heap. The member variable m_heap presents the root node of the heap data structure and it is of the type Post. The following table presents the list of member functions that need implementation.
Note: it is the responsibility of the user to pass compatible arguments priFn and heapType. The class implementation does not need to check the compatibility.
Note: rebuild means transferring nodes from the current data structure to the new data structure. Rebuild operation does not re-create the nodes.
Additional Requirements
- Private helper functions must be declared in the header file. No other modifications to the header file are permitted!
- No STL containers or additional libraries may be used in the implementation of the project classes. However, you can use STL containers in the Tester class for the testing purposes.
- The required functionality is provided in the Post class. There is no need for any modifications to the implementation of this class.
- The test program must use the provided Random class to generate test data.
- Your code should not have any memory leaks or memory errors.
- In the case of a leftist heap the lowest level of nodes which store the keys have zero NPL value.
- Computed priority values may not be pre-computed and stored with the Post object in the queue. They must be computed as needed using the priority function.
- Insertion to and extraction from the heap must run in amortized logarithmic time.
- Follow all coding standards as described on the C++ Coding Standards. In particular, indentations and meaningful comments are important.
Testing
- The test file name must be mytest.cpp; the file name must be in lower case, a file name like myTest.cpp is not acceptable.
- The test file must contain the declaration and implementation of your Tester class and the main() function as well as all your test cases, i.e. calls to your test functions.
- You are responsible for thoroughly testing your work before submission. The following section presents a non-exhaustive list of tests to perform on your implementation.
- You must write a separate function for every test case.
- Every test function must return true/false depending on passing or failing the test. Visual outputs are not accepted as test results.
- Tests cannot be interactive. The test file mytest.cpp must compile and run to completion.
- An example of declaring, implementing, and calling a test function, and outputting the test results was provided in the driver.cpp file of project 0.
- The testing guidelines page provides information that helps you to write more effective test cases.
Note: Testing incrementally makes finding bugs easier. Once you finish a function and it is testable, make sure it is working correctly.
Testing SQueue class
Note: The majority of tests need to check whether the heap property is satisfied after the operations. You might want to implement a helper function in your Tester class that performs such a functionality. Then, this helper can be called in multiple test functions.
- Test insertion for a normal case of min-heap. After a decent number of insertion (e.g. 300 nodes) we traverse the tree and check that the heap property is satisfied at every node.
- Test insertion for a normal case of max-heap. After a decent number of insertion (e.g. 300 nodes) we traverse the tree and check that the heap property is satisfied at every node.
- Test removal for a normal case of min-heap. After a decent number of insertion (e.g. 300 nodes) we check whether all removals happen in the correct order.
- Test removal for a normal case of max-heap. After a decent number of insertion (e.g. 300 nodes) we check whether all removals happen in the correct order.
- Test all nodes in a leftist heap have the correct NPL value.
- Test a leftist heap preserves the property of such a heap, i.e. at every node the NPL value of the left child is greater than or equal to the NPL value of the right child.
- Test whether after changing the priority function a correct heap is rebuilt with the same data (nodes) and the different priority function.
- Test merge of an empty queue (an edge case) with a normal queue. This is a call to the function SQueue::mergeWithQueue(SQueue& rhs) where rhs is a normally populated queue.
- Test the SQueue class copy constructor for a normal case.
- Test the SQueue class copy constructor for an edge case.
- Test the SQueue class assignment operator for a normal case.
- Test the SQueue class assignment operator for an edge case.
- Test that attempting to dequeue an empty queue throws an out_of_range exception.
- Test that attempting to merge queues with different priority functions throws a domain_error exception.
Random Numbers for Testing
For testing purposes, we need data. Data can be written as fixed values or can be generated randomly. Writing fixed data values might be a tedious work since we need a large amount of data points. You must use the provided Random class to generate test data.
In the file driver.cpp there is the class Random which generates pseudorandom numbers. The class is using a default seed value. On the same machine it always generates the same sequence of numbers. That is why the numbers are called pseudorandom numbers, they are not real random numbers. Please note, the numbers are machine dependent, therefore, the numbers you see in the sample file driver.txt might be different from the numbers your machine generates.
Memory leaks and errors
- Run your test program in valgrind; check that there are no memory leaks or errors.
Note: If valgrind finds memory errors, compile your code with the -g option to enable debugging support and then re-run valgrind with the -s and --track-origins=yes options. valgrind will show you the lines numbers where the errors are detected and can usually tell you which line is causing the error. - Never ignore warnings. They are a major source of errors in a program.
What to Submit
You must submit the following files to the proj3 directory.
- squeue.h
- squeue.cpp
- mytest.cpp (This file contains your Tester class, all test functions, all test cases, priority functions, and the main function.)
If you followed the instructions in the Project Submission page to set up your directories, you can submit your code using the following command:
cp squeue.h squeue.cpp mytest.cpp ~/cs341proj/proj3/
Grading Rubric
For details of grading rubric please refer to Project Grading Policy