Find the Fastest Sort Method with O(n*log(n)) Time Complexity

In summary: So if you have an array of 100 elements, and you want to sort it in descending order, the children of position 0 would be in position 0, the children of position 1 would be in position 1, and so on. The children of position N+1 would be in positions N+1, N+2, 2N, and so forth. That way, you would only have to copy the array once instead of making a new string for each assignment.
  • #1
Alkatran
Science Advisor
Homework Helper
959
0
I know that the fastest sort has O(n*log(n)) but I can't figure out what it is!

Here's what I have...

Use a tree structure: place the first value in the tree. If the next is less than the first, move the first to a child position and place the next in the main position. Otherwise just put the next in the child position. Basically repeat this for every item, with every Node having two children, always moving higher values downwards.

When all values are placed in the list, start taking them off.

remove element 1, and then check which of it's children are highest: move that child up and do the same computation for the now empty node: repeat until you hit a node with no children. Then remove element 1 again...

So... that comes out to:
O(n) items added * O(log[2](n)) sorting new items into heap
O(n) items removed * O(log[2](n)) sorting new items out of heap

That comes to:
O(2*log[2](n)*n)

Is that it? How do they eliminate the 2?
 
Computer science news on Phys.org
  • #2
Some sorting algorithms run in O(n) time! (Radix sort, bucket sort... ones not based on comparisons)

Anyways, remember that O(c f(n)) = O(f(n)) for any positive constant c...
 
  • #3
Isn't bucket sort another name for bubble sort? Complexity O(n^2).
Quicksort and treesort are the fastest maybe, of complexity O(nlogn).
 
  • #4
Radix sort and bucket sort is only O(n) in the best case scenario. It is more common to find it to be O(n log log n), which is better than quicksort which is O(n log n). The problem with radix and bucket sorts is that it is only good for small amount of data.

Here is an example where you want to use bucket or radix sorts:

4.3, 2.4, 6.4, 3.3, 7.5, 2.5, 5,3

You'll basically create a holding place (bucket if you will) for the range of integers: 2,3,4,5,6,7. Then you only go through the list in one pass dumping the numbers in their apporiate holding place. Once your done going through the list you concatanate all the buckets.

If each bucket only has one key when you are done going through the list then it is O(n), but in most cases you'll have multiple keys inside each bucket, therefore you have to sort each bucket that has more than one key. The more keys you have per bucket the moe ineffient the algorithm is. Qucksort doesn't degenerate as bad as bucket or radix sort when the key size gets large. For strings you'll get O(n^2) for bucket sort and O(n log n) for quicksort. You might as well use bubble sort O(n^2) if your going to use bucket sort for strings.
 
  • #5
Introsort, incidentally, is the fastest comparison based sorting algorithm. It runs quicksort unless it starts performing poorly, then switches over to heapsort.
 
  • #6
I like the idea of the bucket sort. But it is definitely limited because you have to define the range. It would be completely useless for double type variables.

In any case I wrote up my idea in visual basic. 45 seconds to sort 100 000 random strings (vary between 5 and 10 characters from A-Z) and 2 children per node. Increasing childs per node seems to have no noticeable effect on the end result. It only takes 2 seconds to sort 10 000 strings, which is amazing IMO.

Anyways, what is the difference between quicksort and heapsort? Heapsort is what I'm doing correct?
 
  • #7
What you're doing is almost heapsort. When you remove the root of the tree, the usual algorithm moves the last child node to the root, then sifts it down. It's slightly more efficient because you can keep the tree balanced this way.

Also, for additional speed, there's a clever way to store the tree as an array.
 
  • #8
That's how I used to store it, but I find this method much easier to understand (object oriented).

The idea is to store the value for a Node's children at the 2^(Node's position) (+1 for second child) position, correct?


I tried to sort a millikon strings, but I stopped once I realized it was taking up 200 megabytes of memory and rising.
 
  • #9
The problem is probably that it's making a new string every time you do an assignment. (I don't know VBASIC) One common solution to this problem is to maintain an array of pointers, and only shuffle arouund this auxilliary array.

The children of array position N would be 2N and 2N+1.
 

FAQ: Find the Fastest Sort Method with O(n*log(n)) Time Complexity

What does O(n*log(n)) time complexity mean?

O(n*log(n)) time complexity is a measure of the efficiency of an algorithm, specifically the speed at which it can process data. It means that the time it takes for the algorithm to run is proportional to the size of the data (n) multiplied by the logarithm of that size. This is considered an efficient time complexity as it is faster than O(n^2) but not as fast as O(n).

Why is it important to find the fastest sort method with O(n*log(n)) time complexity?

Finding the fastest sort method with O(n*log(n)) time complexity is important because it allows us to efficiently sort large amounts of data. As the size of the data increases, the difference in speed between O(n*log(n)) and slower time complexities becomes more significant. This is especially important in fields such as computer science and data analysis where sorting large datasets is a common task.

What are some common sort methods with O(n*log(n)) time complexity?

Some common sort methods with O(n*log(n)) time complexity include Merge Sort, Quick Sort, and Heap Sort. These methods use divide and conquer techniques to efficiently sort the data in a relatively short amount of time.

How do you determine which sort method is the fastest?

The speed of a sort method can be determined by analyzing its time complexity and comparing it to other sort methods. A method with a lower time complexity, such as O(n*log(n)), will generally be faster than a method with a higher time complexity, such as O(n^2). However, the actual speed of a sort method can also depend on factors such as the data being sorted and the implementation of the algorithm.

Are there any drawbacks to using a sort method with O(n*log(n)) time complexity?

One potential drawback of using a sort method with O(n*log(n)) time complexity is that it may require more memory than other methods with slower time complexities. This is because some methods, like Merge Sort, require additional space to store the data being sorted. Additionally, some sort methods may not be stable (maintaining the original order of equal elements) or may have a worst-case time complexity that is slower than O(n*log(n)).

Back
Top