Gtk3 vs HTML5

The last few weeks I’ve been working on an interesting new idea, hacking out a prototype.

The code is not really clean enough for public consumption yet, and a bunch of features are missing. However, its now at the stage where it can be demoed and evaluated.

I think the best way to introduce it is via a video: (original theora file)

[vimeo width=”763″ height=”512″]http://vimeo.com/17132064[/vimeo]

Basically, its a backend for Gtk+ 3 that renders in a browser.

A more techincal description for the web geeks among us:

Each toplevel window is mapped to a canvas element, and the content in the windows is updated by streaming commands over a multipart/x-mixed-replace XMLHttpRequest that uses gzip Content-Encoding to compress the data. Window data is pushed as region copies (for scrolling) and image diffs. Images are sent as data: uris of uncompressed png data.

Input is gathered via dom events and sent to the server using websockets.

Right now this is Firefox 4 only, but it could be made to work in any browser with websockets.

Now, I want to know, Is this useful?

There are two basic ways to use this, you can either run your own apps on your own server and access it from anywhere (kinda like screen). Or you can put it on a public server that spawns a new instance of the app for every user (gimp on a webpage!).

If you had this technology, what cool stuff would you do with it? What apps would you run, and how would you use them?

175 thoughts on “Gtk3 vs HTML5”

oliver says:

November 24, 2010 at 10:01 am

This is amazing. It means we can use the existing GTK applications and make them available in the browser. And it also means that the scope of the GTK framework has just increased a lot – we can use the same API for desktop and web!

Could you clarify about the technical background? Is every screen update sent as image, or is stuff like GtkLabel text actually rendered by the browser? How does it compare bandwidth-wise with VNC and X11?

Reply
mikeC says:

November 24, 2010 at 10:24 am

Would you consider replacing WebSockets with BOSH in order to this running on all the browsers that currently support the canvas tag? Or releasing the source so I could do it?

Reply
Stu says:

November 24, 2010 at 10:28 am

A cairo backend that worked like this would also be awesome (drawing goes to html canvas with javascript).

Another issue to solve is to make a filesystem and DND integration, then your basically there.

Reply
Julien says:

November 24, 2010 at 10:48 am

Awesome work dude !

We’d love to see this on github 🙂

cheers

Reply
bochecha says:

November 24, 2010 at 10:57 am

> “Now, I want to know, Is this useful?”

Are you kidding? Of course it’s useful!

Translators might be very happy about that:
http://trac.transifex.org/ticket/581

Reply
Dimitris Glezos says:

November 24, 2010 at 11:10 am

So awesome!

This could be used by translators of an application to preview their result and make sure the screen looks OK with the translated messages. In fact, a tool like Transifex (www.transifex.net) could show the preview for them.

Reply
Zeeshan Ali (Khattak) says:

November 24, 2010 at 11:37 am

I’ve been pondering if this is possible and you just proved that it is very much possible. 🙂 Being able to write applications for the desktop that works out of the box on the web would be the wet dream of many web developers out there who don’t really want to work on the web but have to do it.

Reply
esarbe says:

November 24, 2010 at 11:39 am

Nice! Awsome! Incredible! GTK+ just gained network transparency. That would be a boost for remote management, especially if the various distributions really start dropping X. And it doesn’t even require a specialized client application, just a browser. It would could even be useful for desktop-sharing and the remote help desk!

How does it perform?

Reply
Pingback: GTK+3.0跑在HTML 5上！——跟X说再见！ | liansi.org
Robin says:

November 24, 2010 at 11:51 am

Wow, you have just topped the list of my favourite hackers again! Previously on the first spot: Alexander Larsson :).

Reply
Ben Werdmuller says:

November 24, 2010 at 11:58 am

This is incredible stuff – really shows the capabilities of HTML5 (not to mention your own coding prowess!).

We’re getting to the point of having fully-featured apps, running on a server, accessed via a naked web browser with no plugins. That’s huge. Add some kind of decentralized file store, and you’ve got a properly decentralized networked application framework with almost limitless possibilities.

Reply
alexl says:

November 24, 2010 at 11:59 am

@simon:

It can handle anything that draws only using cairo. That includes WebKitGTK,
but not clutter (that uses OpenGL).

@jono chang:
I have not really measured the bandwidth yet, but my guess is that it does
pretty good on typical UIs, but its clearly not good for displaying video
or large fancy animations.

@oliver:

Technical details on rendering:
Not every drawing operation is sent over the wire, instead we send updates
after any expose event handling is finished. We also keep track of the
last image we sent to the client. So, when we get to update the image
we can compare the two images and send only the rectangles that were changed,
and additionally we send only the pixels that changed, the rest are sent as
black alpha=0 pixels, so these compress well. Additionally we do catch window
scrolling and send these as rect-list copies, so scrolling involves just a
single bitblit + the image for the newly scrolled in data.

roc:

What would help is a way to transfer image data that is more efficient than
base64 encoded data uris.

Jon Smirl:

I don’t see any reason for the primary way to render the buffer being html.
However, the same binary could easily allow both access via html and some more
efficient protocol for wayland buffer rendering.

@Havoc:

Firefox uses a fair chunk of X specific hacks in its gtk code… Would be
easier to run GtkWebKit in firefox! 🙂

@Vladimir:

While sending cairo commands (or some other rendering commandset) over the
link seems like a good idea I don’t think its actually the right approach.
Much of the UI in apps are rendered as pixmaps anyway, and there are several
layers of overdraw when rendering a full window, so i’m not sure there is less
data to sent. And, with the image data version its very easy to compare the
last and new frame and only send the difference, something which is very
hard to do in a full rendering api style protocol.

@Michael:

Good spotting, yes, keyboard input is one of the things left to do. Trivial
keyboard input should be easy, however there will probably be issues with
more complex input, as the browser ui steals many key combinations.

@Martin Sevior:

Its gtk+ 3.0 only, so you need to port abiword to do all rendering via cairo.

@Reeks:
Technically it might be possible to do OpenGL via webgl. However, its gonna
be hard to catch the gl calls and forward them. You’ll need a custom libgl.
Also, its unlikely to perform well if you’re using lots of textures. Could
work well for simple cases though.

@rms:

Well, emacs gtk+ port uses a lot of direct X calls, so it won’t work as is.
You’d have two alternatives to make it work:
* port all emacs rendering to “pure” tk+/cairo use
or
* Duplicate the work i’ve done for the backend as a separate emacs rendering
mode. Its really not that hard.

I think the second would be the easiest.

@lucasshrew:
@jyf:

Its not very unlike a vnc or an xserver, yeah. In fact I originally thought
to do an xserver implementation in the same way. However, in practice i think
it makes more sense to export an app, rather than a full desktop. The user
already has a desktop already.

@Luke Leighton:

Well, pygtkweb is “like gtk+”, which is only useful if you’re writing new
code for web use. This *is* gtk+, so any existing app runs with just a rebuild.

@murrayy:

Sure, its themable, but the themeing happens server side. We could have the
client tell the server what theme to pick though.

@Yann:

Sure, you can use any gtk+ binding. Its just a normal gtk+ app afterall.

@Pjvandehaar:

There are many ways to display html in a gtk app, the best one atm is GtkWebKit.

@Wingo:

No keyboard input yet, nor clipboard. However, simple keyboard input should
be easy to add (accelerators stolen by the browser window is tricky though).
I’m not sure if clipboard is doable. Need to look into how js+dom can modify
the clipboard.

@mikeC:

I don’t think a BOSH like approach will work. There are far to many events
going from the client to the server on e.g. mouse move for it to work with a
new http connection per message sent.

@Stu:

As i said above, i’m not sure forwarding cairo rendering commands is actually
more efficient.

Reply
Timo Juhani Lindfors says:

November 24, 2010 at 12:14 pm

Sounds very cool indeed. At least at http://cofundos.org/project.php?id=67 people have been waiting for something like that. How invasive changes did you need to do? (I’m assuming small changes all over GTK and then some thread to handle http?)

Reply
1. alexl says:
  
  November 24, 2010 at 12:18 pm
  
  @Timo:
  
  Its a new gdk backend. No gtk or app changes are necessary. I/O is done on the mainloop, not a thread. So your app better not starve that.
  
  Reply
Jon Wood says:

November 24, 2010 at 12:22 pm

I’m split on this. As a geek, the elegance and ingenuity is something that puts a big smile on my face.

As a web developer, it slightly concerns me. For certain specialised purposes I can see this being somewhat useful – esarbe mentions using it as a replacement for X remoting, which would be awesome. I like the idea of being able to point a web browser at my workstation.

My concern is Zeeshan’s comment “web developers out there who don’t really want to work on the web but have to do it”. There are specific reasons why writing software for the web is as it is, not least accessibility, and replacing HTML, CSS and Javascript with a big canvas tag is not a good thing.

Reply
1. alexl says:
  
  November 24, 2010 at 12:30 pm
  
  Jon Wood:
  
  Additionally it hides the source from the web browser, which is very un-web:esque.
  So, I don’t hope this will replace the web as is, but complement it.
  
  Reply
Andrew Theken says:

November 24, 2010 at 12:33 pm

This is fantastic. How much bandwidth is required to sustain an app session (I know it will vary dramatically)?

Reply
Jon Smirl says:

November 24, 2010 at 12:55 pm

@alexl:

Keep improving this. There are lots of ways to make it faster. For example cache images in the client, compute deltas to the screen DOM and send them as JSON, build client side widgets (text editor, dialogs, menus) and replace GTK at the widget level instead of drawing level. Browsers have WebGL now so you can remote GL apps. Search for the Google Quake in Chrome demo.

We can start with two paint functions, normal and this one. But over time you’d like to move to only the HTML5 one and optimize the app to use it. The high level concept here is that HTML5 is the new toolkit, GTK/HTML5 is a transition tool to this end. As apps under go this transition they will evolve their UI to become better optimized on the HTML5 toolkit.

Making this transition has major impacts. It makes every Linux app network transparent to any OS. It replaces X transparency. It is a great solution for people using VMs. HTML5 standardizes everything. HTML5 is themable via CSS.

Don’t worry about using a socket to the local toolkit. That process is very fast. You are using it today in the X server. The X server is the identical model as using HTML5. xlib packages your drawing requests and sends them over a socket to the xserver which draws them. You are generating HTML5 requests and sending them over a socket to a HTML5 toolkit which will render them into Wayland. But the HTML5 is much more efficient since it is higher level and it is standardized.

Reply
Jon Smirl says:

November 24, 2010 at 1:17 pm

Video can be handled by setting up a separate stream from the HTML5 front end. App asks for a video widget, you make an HTML5 widget, HTML5 widget asks the server for a video stream, server redirect the video stream from the client to the HTML5 engine. You will have to assume the the HTML5 client has a codec that can decode the stream.

This is part of changing the apps. You don’t want the apps doing their own audio and video decompression. You need to send the compressed stream to the front end.

Another example: think of Google Docs if they gave us the source code and let us run the server on our local machines. Google Docs UI is going to greatly improve once it is converted to HTML5.

GL textures can be sent across the wire and be cached in the remote GL engine. People should be storing these in GPU memory instead of system RAM anyway. I am always running out of system RAM while my GPU memory sits there empty.

Reply
pancake says:

November 24, 2010 at 1:49 pm

Do you know blitzen and maja projects?

blitzen is a application server based on GObject which uses Stk, a gtk-like library designed to render into native HTML elements. I think both projects can benefit on this.

maja is a vala-to-javascript compiler.

Both projects can be used together in order to get something like GWT, but much more gtk-friendly, vala based and more free-software like.

Good work!

Reply
Glenn says:

November 24, 2010 at 4:09 pm

I’ve always had a very serious need to use Gimp on public (school) terminals. Maybe this will eventually work for that.

Reply
Pingback: Links 24/11/2010: Avatar Reveals Reliance on GNU/Linux, Acer Distributes More Android | Techrights
Pingback: Tweets that mention Gtk3 vs HTML5 « Alexander Larsson -- Topsy.com
Pingback: Gtk3 vs HTML5 | liansi.org
Alon says:

November 24, 2010 at 6:02 pm

Hi Alex,

Very cool. Will you be doing a backend in spice next maybe?

Alon

Reply
1. alexl says:
  
  November 24, 2010 at 6:18 pm
  
  @alon:
  
  Heh, i didn’t plan to, but its clearly doable.
  
  Reply
behdad says:

November 24, 2010 at 6:36 pm

Absolutely amazing.

– Is a pulseaudio backend also coming? 😉 I’d love to open my rhythmbox instance running on my home computer and stream music on my laptop / iphone.

– GtkPrinting backend would be nice too :).

– Buildbots can now let you start the built app and play around.

– Selection is a major issue, but it can be fixed easily I’m sure.

Reply
CaptianObvious says:

November 24, 2010 at 7:49 pm

Very cool!

But what about security? The gtk+ program can do whatever it likes on the server it runs on?

Reply
Pingback: Gtk3 vs HTML crosspost - D0znpp blog
Pirvu says:

November 24, 2010 at 11:17 pm

Wow …

Reply
teadict says:

November 25, 2010 at 12:15 am

This will change the civilization as we know it!

Reply
Pingback: 404 Not Found
Pingback: GTK3 научили отображаться в веб-браузерах « Verlinks
nuclight says:

November 25, 2010 at 9:24 am

This thing exists for years on Qt, it is known as Vedga (former Glan), see english part of http://kalpa.ru. Just not in a browser: thin universal client speaking gzip-compressed protocol, and QT app runs on server, just it’s screen is local. Very fast, works even on a modem connection.

Reply
Pingback: Представлен бэкенд для формирования вывода Gtk+ через web-браузер
MrJuren says:

November 26, 2010 at 1:29 am

that’s cool~
when code can integration work.
the next problem is internet speed.

Reply
jcadam says:

November 26, 2010 at 4:29 am

A remote QT is totally different with a gtk app run on the web! You can not integrate legacy QT app to next generation web apps. However, Alex finds a way to get gtk worked. I guess that’s why people like it.

Reply
string says:

November 26, 2010 at 5:13 am

When would the code be released, it may get more enhancement from the community.

Reply
Salwar kameez says:

November 26, 2010 at 5:28 am

WOW… Awesome…

Reply
Burhan KILINC says:

November 26, 2010 at 8:00 am

It can be good for using GnuCash when outside of the office.

Reply
Pingback: HTML 5 Canvas: the only plugin you need? « Tim Anderson’s ITWriting
Pingback: Gtk+ 3.0 html5 backend « Alexander Larsson
dora says:

November 26, 2010 at 6:23 pm

Uhh? isnt this just an Ajax VNC? open google and search for ajax VNC. You dont really need canvas, just an IMG tag will do.

Reply
another_sam says:

November 26, 2010 at 9:10 pm

So, is Gtk+ now available for Maemo, Windows Mobile, Android, and whichever thing able to run Firefox?

Reply
oiaohm says:

November 26, 2010 at 9:57 pm

I will state something key. Don’t depend on webgl. GLX with X11 fails across network due to the huge size and amount of traffic opengl can send backwards and forwards. virtualgl would be a good place to start as well as the method wayland uses for rendering.

Really GTK needs a way to tag Opengl that can be sent to client and opengl that need to be processed server side.

Interesting would be seeing integration with the likes of eyeos. So we can have like a full desktop in the webbrowser.

Reply
Pingback: ¿Aplicaciones de Escritorio dentro del Navegador Web?
Pingback: برامج جنوم 3 قد تعمل ضمن برامج التصفح
Pingback: » GTK3 跑在浏览器里 Wow! Ubuntu / Ubuntu 及 Linux 新闻、技巧、软件及游戏！
Justin C. says:

November 27, 2010 at 10:13 am

This proof of concept is amazingly terrific! Great job.

With that said, does this project not scare the hell out of anyone else? With malware, phishing, and arbitrary-code-execution exploits rampant (mostly on Windows thankfully), what’s to stop criminals from embedding Gtk+ apps in webpages that phish for my passwords or worse.

I can’t image it’s very hard to recreate the gnome-keyring or gksu window and present it to unsuspecting users through html5 websockets.

Web-browsers are capable of connecting to multiple domains. If each domain uses this gtk interface, how do I determine which domain that login popup belongs to? I don’t want to login into my “gtk facebook” page only to realize later that login box was from a malicious domain I happened across.

With a second look, after that knee-jerk reaction, it’s clear that this technology is confined within the canvas object of the web browser, which will eliminate arbitrary-code-execution and malware as security concerns, but still leaves phishing.

Assuming that the address bar of your web browser cannot be hidden, or the title bar text changed (in such a way that you can’t tell the window belongs to your web browser), then popup windows with gtk interfaces can be distinguished as local or internet-originating, and/or their domains identified. This assumes, however, that all browsers meet the minimum requirements and users are vigilant.

One aspect that may generate confusion and become a potential vulnerability, is focus indicators. I assume that the gtk app on the remote client will still display the caret in it’s text box after I’ve moved focus to another html5 object (perhaps a text box). Now my screen is “indicating” that keyboard input can go to one of two places. And this will get worse if more remote apps are visible on screen at the same time. Are canvas focus events (enter/leave) sent to the remote-gtk app, to handle focus indicators?

Again, Alex, great work. I can see the potential applications for this, but only for private use on private networks. I hope my concerns for security are taken for what they are–my opinions–but lead you to greater thoughts for making this technology secure enough for use as a public standard on public networks. I would caution against relying solely on security provided by web browsers (the “let someone else handle it” mentality). Not all browsers are equal in this regard.

Good luck!

Reply
Marco Trevisan (Treviño) says:

November 27, 2010 at 11:55 am

Cool implementation, I was waiting for this!

Just for information, what mono-space font are you using in Gedit? It’s so nice! 🙂

Reply
1. alexl says:
  
  November 27, 2010 at 6:39 pm
  
  @marco:
  The font is “Envy Code R” (http://damieng.com/blog/2008/05/26/envy-code-r-preview-7-coding-font-released)
  
  Reply
Pingback: GTK3程序跑在浏览器里 | IT News - 发布最新IT信息
Francesco says:

November 28, 2010 at 1:18 am

What about multiple connection? is there always one thread?
thanks for your work

Reply
Richard says:

November 28, 2010 at 3:56 am

(didn’t read all 151 other comments)

It would at least be nice to demo applications before installing them.

Also, using them, or having your own desktop available to a web browser (“Security!”)

Reply

175 thoughts on “Gtk3 vs HTML5”

Leave a Reply Cancel reply