GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:19:30 Sequence gi568815585r:36070531_36314526 : 243996 bp : 38.42% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9658 9748 91 0 1 28 100 120 0.736 7.90 1.02 Intr + 18641 18719 79 0 1 49 66 73 0.342 -1.11 1.03 Intr + 24145 24276 132 1 0 58 95 116 0.411 8.14 1.04 Intr + 28804 28960 157 1 1 23 73 66 0.152 -2.31 1.05 Term + 32057 32215 159 1 0 54 50 96 0.067 -0.64 1.06 PlyA + 33045 33050 6 1.05 2.12 PlyA - 33357 33352 6 1.05 2.11 Term - 33838 33773 66 1 0 98 42 52 0.207 -1.44 2.10 Intr - 41685 41339 347 1 2 93 110 381 0.984 34.99 2.09 Intr - 55613 55232 382 2 1 -24 105 472 0.047 31.26 2.08 Intr - 56538 56407 132 0 0 105 47 71 0.061 4.62 2.07 Intr - 57993 57852 142 2 1 120 -10 130 0.115 5.83 2.06 Intr - 58084 58050 35 1 2 119 76 2 0.851 -1.80 2.05 Intr - 58595 58510 86 0 2 82 87 31 0.781 1.02 2.04 Intr - 60093 59863 231 1 0 30 89 178 0.401 8.92 2.03 Intr - 60658 60584 75 2 0 40 90 126 0.163 6.57 2.02 Intr - 80040 80010 31 0 1 85 78 25 0.018 -2.01 2.01 Init - 81563 81429 135 2 0 65 81 65 0.317 3.69 2.00 Prom - 82093 82054 40 -5.25 3.00 Prom + 82383 82422 40 -6.95 3.01 Init + 83303 83354 52 1 1 74 93 43 0.934 4.77 3.02 Intr + 84263 84436 174 0 0 65 60 67 0.590 0.59 3.03 Intr + 86346 86566 221 1 2 102 61 137 0.556 9.50 3.04 Term + 92302 92523 222 0 0 59 43 83 0.069 -3.17 3.05 PlyA + 92643 92648 6 1.05 4.10 PlyA - 94059 94054 6 1.05 4.09 Term - 99812 99783 30 2 0 91 48 35 0.510 -3.32 4.08 Intr - 100257 100001 257 1 2 71 119 134 0.959 11.04 4.07 Intr - 103280 103162 119 1 2 74 83 107 0.978 7.99 4.06 Intr - 104037 103946 92 0 2 80 60 57 0.606 -0.03 4.05 Intr - 104339 104192 148 1 1 64 68 91 0.373 4.02 4.04 Intr - 121364 121265 100 0 1 55 53 134 0.234 4.85 4.03 Intr - 123337 123276 62 0 2 110 95 -29 0.062 -2.54 4.02 Intr - 131563 131349 215 1 2 141 65 258 0.423 25.39 4.01 Init - 138429 138253 177 0 0 61 54 74 0.590 0.61 4.00 Prom - 139042 139003 40 -5.55 5.08 PlyA - 139487 139482 6 1.05 5.07 Term - 144348 144129 220 2 1 113 49 181 0.769 12.23 5.06 Intr - 145279 145161 119 1 2 76 -26 123 0.666 -1.96 5.05 Intr - 161191 160874 318 1 0 72 66 87 0.056 0.13 5.04 Intr - 164123 164064 60 2 0 108 70 66 0.674 4.81 5.03 Intr - 183326 183273 54 2 0 96 93 82 0.991 7.76 5.02 Intr - 183613 183515 99 1 0 61 76 121 0.992 7.59 5.01 Init - 185542 185333 210 1 0 73 23 201 0.427 10.93 5.00 Prom - 185781 185742 40 -8.05 6.00 Prom + 185910 185949 40 -9.55 6.01 Init + 185960 186046 87 1 0 84 35 49 0.108 -0.11 6.02 Term + 188544 188855 312 2 0 41 47 286 0.497 13.82 6.03 PlyA + 191961 191966 6 1.05 7.00 Prom + 192841 192880 40 -6.95 7.01 Sngl + 199147 199497 351 0 0 46 43 256 0.989 12.70 7.02 PlyA + 200078 200083 6 1.05 8.08 PlyA - 201910 201905 6 -0.45 8.07 Term - 202433 202306 128 0 2 46 49 141 0.337 3.46 8.06 Intr - 213172 213062 111 2 0 96 87 88 0.991 8.93 8.05 Intr - 215570 215495 76 2 1 6 87 92 0.789 -0.93 8.04 Intr - 216277 216206 72 1 0 111 92 -5 0.517 0.88 8.03 Intr - 218832 218698 135 0 0 44 36 106 0.539 0.84 8.02 Intr - 220335 220251 85 2 1 63 66 93 0.846 3.50 8.01 Init - 227189 227107 83 2 2 73 81 173 0.969 15.59 8.00 Prom - 233021 232982 40 -4.95 9.05 PlyA - 233210 233205 6 1.05 9.04 Term - 234102 233835 268 2 1 73 45 348 0.998 22.98 9.03 Intr - 241705 241615 91 2 1 56 108 53 0.866 2.23 9.02 Intr - 241947 241789 159 1 0 34 87 157 0.768 9.24 9.01 Intr - 243891 243697 195 1 0 35 38 246 0.769 12.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 57993 57780 214 2 1 120 42 118 0.814 6.12 S.002 Sngl + 117007 117315 309 0 0 83 38 168 0.807 6.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:36070531_36314526|GENSCAN_predicted_peptide_1|205_aa MKKEGEKGEQGLACADNQCSGGGHKKGKPKASSMGLISAPFRIILPLYNVTYLRINIGSR AISREALYILKMYCSGQNDEEPWRASAVKEETISGQEYDFLYLKYEGNVTVFRQSHFQTV QGHPQIRLTVSIISYSHEHTNVKMQVSLVMKLAGGTHKDHGDCAPIDLKSLLPGMYKTRL DLWNQTRFEKCDFFSKRILDKADGV >gi568815585r:36070531_36314526|GENSCAN_predicted_CDS_1|618_bp atgaagaaggaaggtgagaaaggagagcaaggcctagcttgtgcggataaccagtgttcg ggtggagggcataagaaaggaaagccgaaagcgtcttcaatgggcctgatttctgcccca tttcgaatcattcttcctctgtacaatgtcacatatctgcgaataaatattggctccaga gctatcagccgggaagcactttacattttgaaaatgtattgctcaggacaaaatgatgaa gagccctggagagcatctgctgtgaaagaagaaacaatttcgggccaggaatatgatttc ctttatttaaagtacgaaggcaatgttacagtgtttcggcagtcacattttcaaacagta caaggacatcctcaaataagattaactgtatcaatcatctcatactcccatgaacataca aatgtgaaaatgcaagtttctctagtaatgaaacttgcaggaggaactcataaagatcat ggtgactgtgctcccattgacctaaaaagcctgttacctggcatgtacaaaacaagactg gacttatggaatcagacacgttttgaaaaatgtgactttttttctaagaggattctagac aaagcagatggagtctga >gi568815585r:36070531_36314526|GENSCAN_predicted_peptide_2|553_aa MIRLSPTGYLSQHMGIMGATIQDEIWVGTQPDHIIRVVQETPWGVVPMSETAGSNARRGQ HQEAAPSAATKTPGGVSPRTAWVLQVRLQGLKFLQGQRGCLMFKLVPAVLKACGEEMAFF PPSGSARTAPQCPTKEAAFENQVCAELLLDLARRCLVVYINLEGNPQTPSVGRMITAFKN KTQTSKTFSGVSNRPISRRGLDQLCGLSKVTELVRSEGLQNPASHAEEEPVSKEQGMENT SFGVTPTIATSLSLYDDMRAYFLPATRISPLPCKEQWALESGKPGTELLSIMSFGRDMEL EHFDERDKAQRYSRGSRVNGLPSPTHSAHCSFYRTRTLQTLSSEKKAKKVRFYRNGDRYF KGIVYAISPDRFRSFEALLADLTRTLSDNVNLPQGVRTIYTIDGLKKISSLDQLVEGESY VCGSIEPFKKLEYTKNVNPNWSVNVKTTSASRAVSSLATAKGSPSEVRENKDFIRPKLVT IIRSGVKPRKAVRILLNKKTAHSFEQVLTDITDAIKLDSGVVKRLYTLDGKQDEAKKGVL LAFESPELDLVND >gi568815585r:36070531_36314526|GENSCAN_predicted_CDS_2|1662_bp atgattcgattatctcccactggatacctctcacaacacatgggaattatgggagctaca attcaagatgagatttgggtggggacacagccagaccatatcatcagggttgttcaggag actccttggggagttgtacccatgagtgagactgctggatcaaatgcccggcgaggacag caccaggaggcggcccccagcgcggccacaaagacccccggcggcgtctctccgcggacc gcgtgggtgttgcaagtacgtttgcagggtctcaagttccttcagggacagcgaggctgc ctcatgttcaaattagtacctgcagtgctaaaagcttgcggagaggaaatggcatttttt cctcctagtgggagcgcacgcacagccccgcaatgccccacaaaagaggctgcttttgaa aaccaggtttgcgcggaattgctcctggatttggcaaggagatgcttggtcgtttatatc aatttggaaggaaatccccaaactcctagtgttgggagaatgatcactgcctttaagaat aaaacacaaacttcaaaaaccttcagtggggtcagcaatagacccataagcagacgaggt ctggatcagttatgtggcctgtccaaggtcactgagctagttaggagtgagggcttacaa aatcctgcttctcatgctgaagaggaaccagtttcaaaggagcaagggatggaaaacacc agctttggagttacacccaccatagctacttctctttcactgtatgatgatatgagagct tattttcttcctgctaccaggattagccctctgccctgcaaggagcagtgggctttggag tcagggaaacctggaacagagttgctgtccatcatgtccttcggcagagacatggagctg gagcacttcgacgagcgggataaggcgcagagatacagccgagggtcgcgggtgaacggc ctgccgagcccgacgcacagcgcccactgcagcttctaccgcacccgcacgctgcagacg ctcagctccgagaagaaggccaagaaagttcgtttctatcgaaacggagatcgatacttc aaagggattgtgtatgccatctccccagaccggttccgatcttttgaggccctgctggct gatttgacccgaactctgtcggataacgtgaatttgccccagggagtgagaacaatctac accattgatgggctcaagaagatttccagcctggaccaactggtggaaggagagagttat gtatgtggctccatagagcccttcaagaaactggagtacaccaagaatgtgaaccccaac tggtcggtgaacgtcaagaccacctcggcttctcgggcagtgtcttcactggccactgcc aaaggaagcccttcagaggtgcgagagaataaggatttcattcggcccaagctggtcacc atcatcagaagtggcgtgaagccacggaaagctgtcaggattctgctgaacaagaaaacg gctcattcctttgagcaggtcctcaccgatatcaccgatgccatcaagctggactcggga gtggtgaaacgcctgtacacgttggatgggaaacaggatgaggctaagaaaggtgttcta ttggcctttgagtctccagagcttgacctggtaaatgactga >gi568815585r:36070531_36314526|GENSCAN_predicted_peptide_3|222_aa MNKQGKKMETTPKTNGEDAKAVPERNFQALAHRLRVNARSRLPRGGWRRLLITSHLRVTA EAGEARGPAVRRKGRGSDPLNRKEKTFTLPLARNLLLLQEGPPENRGPITLVGKHQQTSQ TLLIPFPSSSLAPLFDDSLLKWMPVEMVKPLSSLSDALGETPVKFLLIQQQQPGSQICLE KIPAAPRNQNPSLFRMLQSQEEWYEVTLRTIFLPLCRSCIGE >gi568815585r:36070531_36314526|GENSCAN_predicted_CDS_3|669_bp atgaacaaacaagggaagaagatggagaccactcccaagaccaatggggaagatgcaaaa gcagtccctgaaaggaatttccaggcgctggctcacaggctgcgtgtaaacgcccggtca cgcctcccgcgtggaggatggagacgattactaatcaccagccacctccgagtgacagca gaggcaggggaggctcggggcccagctgtcaggcgcaagggcagaggttctgatccacta aatagaaaggaaaagacctttacactccctttagcaagaaatcttctgctcctgcaggaa ggtccacctgaaaacagagggcccataacactcgttgggaagcaccaacagacatcccag actctcctcatcccatttcctagtagctccctggcaccactgtttgatgacagcttgttg aaatggatgcctgttgaaatggtgaagccattatcttctctgtcagacgcacttggtgaa accccagttaaatttctgcttatacaacagcagcagccaggcagtcaaatttgtctggag aaaatcccagcagcccctaggaaccaaaacccctccttgtttaggatgctacagagccag gaagagtggtatgaagtgacattaagaaccatttttcttcctctgtgcagatcgtgtatt ggggagtag >gi568815585r:36070531_36314526|GENSCAN_predicted_peptide_4|399_aa MQRKKRRKASKGTKQLILFISNLDSRRKIQRQSDFDLISICWVKLALNCESTVKMFSDLA KIDILLVGDVTVGYLADTVQKLFANIAEVTITISDTKEAAALLDDCIFNMVLLKVPSSLS AEELEAIKLIRFGKKKNTHSLFVFIIPENFKVKTENATGPEELGLPLQRSYSEHLGYFPT DLFAWERIKYCCEQLRTLLPYVKGRKNDAASVLEATVDYVKYIREKISPAVMAQITEALQ SNMRFCKKQQTPIELSLPGTVMAQRENSVMSTYSPERGLQFLTNTCWNGCSTPDAESSLD EAVRVPSSSASENAIGDPYKTHISSAALSLNSLHTVRYYSKVTPSYDATAVTNQNISIHL PSAMPPVSKLLPRHCTSGLGQTCTTHPNCLGWCLDIQNQ >gi568815585r:36070531_36314526|GENSCAN_predicted_CDS_4|1200_bp atgcagaggaaaaagaggagaaaagcatccaaaggaacgaagcagcttattcttttcatt agcaacttggattccagaaggaaaattcagagacaaagtgattttgaccttataagtatt tgctgggtaaaacttgcactgaattgtgagagcacagtgaagatgttttcagacttggca aaaatagacatcttattagttggagatgtcactgtgggctacctggctgatactgtacag aaactatttgcaaacatagcagaagtcaccatcaccatcagtgacacgaaggaggcagca gcgcttttggatgattgcatattcaacatggttctcttgaaggtgccttcttcactaagt gccgaggagctggaagccatcaagttaattagatttggcaaaaagaaaaatacacattca ctgtttgtttttataatccctgaaaattttaaagttaagactgaaaacgcaactgggcct gaagaacttggattgcccctgcagaggtcctacagcgaacacctgggatattttcctact gatctatttgcctgggaaagaatcaaatattgctgtgagcagctgcgtactctcttgccg tatgtaaaagggagaaagaatgatgcggcttcagttcttgaggcaacagttgattatgtg aaatatatccgggagaaaatctctccagccgttatggcccagattacagaagcacttcag agcaacatgaggttttgtaagaaacaacaaacacccattgagctgtctctcccaggcact gtcatggcacagcgggaaaacagtgtgatgagcacttactcccctgagagagggctccaa ttcctgactaatacgtgctggaatgggtgctccactcctgatgcagagagctccttggat gaagctgtgagagttccatcaagctccgcctcagagaatgctattggtgatccatataaa actcacatttccagtgcagcgctgtctctgaattccttgcatactgtcagatattattct aaagtcaccccttcctacgatgcaactgctgtaacaaatcagaacatttcaattcattta ccttcagccatgcccccggtctcaaagcttctccctcggcactgcacttctgggttgggc cagacgtgcactacacatcccaactgtctgggctggtgtctcgatatccagaaccagtga >gi568815585r:36070531_36314526|GENSCAN_predicted_peptide_5|359_aa MEATLAGKEGEGKAGAGQDRNETAHMNIGVGQLTSLAHTVLSALHLVKGGKRLRGGRGLG SEWGLGKQDIESLNTLLKQLEEEKKTLESQVKYYALKLEQESKAYQKINNERRTYLAEMS QAFIVSDEKSVVTPIEDPFYRKVFRKLKWPLTLSSQSVRKVGFLVPHKSLMEKGSHGNLR KGIHGSVLSSLDRTFHIRAPDMLLPAFELPALKLYASVAVEMMCYNRRLPKICTVSRNYS QEGFRVKNCRVSQLYFCMPPGSKSDPQDTAPAAQEPGDLPRDRLQRWRRSGSEYAGLSVV RAALGRGLGVRSQLVTRSPKGPQDSLSSPRGQAPDPGSALTPQSGLLLPHAPELATGPL >gi568815585r:36070531_36314526|GENSCAN_predicted_CDS_5|1080_bp atggaagcaacgcttgcaggcaaagaaggagaaggaaaggctggagcaggccaagacagg aatgaaacagcccacatgaatattggagtgggacaactgacctccctggcccacaccgtg ctgagcgccctgcatctggtcaagggtggtaagagactacgtggaggacgaggacttggc agtgaatggggattggggaagcaggacatagaatccttaaacacattacttaaacagcta gaagaagaaaagaagactcttgaaagtcaagtgaaatactatgcacttaaactggaacaa gaatcaaaggcttaccagaagatcaacaatgaacgccgtacatacctagctgaaatgtct caggccttcatagtttctgatgagaaatcagttgttactcctattgaagatcccttttac agaaaagtttttaggaaattgaagtggcctttaactctctcaagtcagagtgtaaggaag gttggctttttagtgccacataagtcattaatggaaaaaggatctcatggaaatctaaga aaaggaattcacggatctgtcttatcctcattagacagaactttccacattagggcacct gacatgctgcttccagcctttgaattacctgcccttaagctgtatgcctctgtagcggtg gagatgatgtgctataacagaaggctgcctaagatttgcactgtaagcagaaactattca caggaaggtttccgtgtgaagaactgtagagtatcacagctgtatttttgcatgcccccg ggatctaaatcggacccccaggacacagcaccagctgctcaggaaccaggagacttgccc agggaccgcctgcagagatggcggcgcagtgggagtgagtacgcggggcttagtgtcgtg agagctgccctgggaaggggacttggagtgcgcagtcagctagtgacgcggtctcctaag ggcccccaggacagcttgagtagcccccgcgggcaggcccccgacccgggttccgcgttg acccctcagagcggcctgctgctcccccacgcgccagaactggcgacagggccactttaa >gi568815585r:36070531_36314526|GENSCAN_predicted_peptide_6|132_aa MEETAPIINDLHLVLPLTCGNITIQDDIWKAGSRDSQDPAPSDTTVYVGSRTKVMRAAMI VRAESRMAELEGSVLTPGDELSQLVPVSPSEPPSCRGFPQEHYELSPRKRFLVSHCYRME SLDCQLSKFLVY >gi568815585r:36070531_36314526|GENSCAN_predicted_CDS_6|399_bp atggaggaaacagcccccataatcaatgacctccatctggtcttgcccttgacatgtggg aatattacaattcaagatgacatttggaaggcaggctccagggactcccaggatcctgct ccctcggataccacagtctatgtaggcagcaggactaaagtgatgagagccgccatgata gtgagggccgagagcaggatggcggaattagaaggcagcgtcctgacccctggcgatgag ctgagtcaattggtccctgtctccccctcagagccaccatcttgtcgtggcttccctcag gaacattatgagctctcccccaggaagcgcttcttggtttcccattgttaccggatggaa agtcttgactgccagttgtccaagttcttggtgtattga >gi568815585r:36070531_36314526|GENSCAN_predicted_peptide_7|116_aa MIISTDAEKALDKIQHHFVIKTLNKLDIEGTYFKIIKTTHDRPTANIILNGEKLKAFSLR TGTRQGCRHAPLLLNIVLEVLARAFGQERNKGHPNWKRGIRTIAVFRRHDRIPRKS >gi568815585r:36070531_36314526|GENSCAN_predicted_CDS_7|351_bp atgatcatctcaacagatgcagaaaaagcactggacaaaatccagcatcactttgtgata aaaactctcaacaaactagacatagaagggacttacttcaaaataataaaaaccacacat gacagacccacagccaacatcatactgaacggggagaagttgaaagcattctccctgaga acaggaacaagacaaggctgccgacatgcaccactcctattgaatatagttctggaagtc ctagccagagcattcgggcaagagagaaataaagggcatccaaattggaaaagaggaatc cgaactattgctgttttcagacgacatgatcgtatacctagaaaatcctaa >gi568815585r:36070531_36314526|GENSCAN_predicted_peptide_8|229_aa MKEERNYNFDGVSTNRLKQQLLEEVRKNGTPIAKELRVKDLNPIKRFRPDGSGGEQTFKI KHFSIRPQQWNQRKKVTEANLEAEGESCHLRPSQGEKKLKAGEPQHPRTEGCGMNPSHFL KSPVKKPDWQAQALEHTALTIRDTKIVTSNCSEWKTRYETQLELNDELEKQIVYLKEKVE KIHGNSSGMEEGGGEKGAENWETVEDEHMEQLVTGTPLPSHAAGILSFS >gi568815585r:36070531_36314526|GENSCAN_predicted_CDS_8|690_bp atgaaggaagagagaaactacaacttcgacggtgtgagcaccaaccgcctgaaacagcag ttgctggaagaagtccgcaagaatgggacccctattgccaaggaacttagagtcaaagac ttaaatccaattaaacgttttaggccagatgggagtggaggtgagcagacctttaagata aaacattttagcatcaggccacaacagtggaaccagaggaaaaaagttacagaagctaat ttagaagctgaaggagagagctgtcatctcagaccttctcagggggagaaaaagctgaaa gcaggagagcctcagcaccctagaactgaagggtgtgggatgaatccctcccatttcctc aagtcaccagtcaagaaaccagactggcaggctcaggccctggagcatacagcactcacc atacgggacactaaaattgtaaccagcaattgtagtgaatggaaaacccgttatgagaca caacttgaattaaatgatgaactagaaaagcaaattgtttatctcaaggagaaagtggaa aaaatccatggaaactcttcaggtatggaagaaggtgggggtgagaaaggagctgaaaac tgggagacagttgaagatgaacacatggaacagttggtcactgggacccctctgccatcc cacgcagctgggatactctccttcagttag >gi568815585r:36070531_36314526|GENSCAN_predicted_peptide_9|237_aa XASWVSWGLVKGAEITGKAIQKGASKLRERIQPEEKPVEVSPAVTKGLYIAKQATGGAAK VSQFLVDGVCTVANCVGKELAPHVKKHGSKLVPESLKKDKDGKSPLDGAMVVAASSVQGF STVWQGLECAAKCIVNNVSAETVQTVRYKYGYNAGEATHHAVDSAVNVGVTAYNINNIGI KAMVKKTATQTGHTLLEDYQIVDNSQRENQEGAANVNVRGEKDEQTKEVKEAKKKDK >gi568815585r:36070531_36314526|GENSCAN_predicted_CDS_9|714_bp ngtgcttcctgggtgagttggggtttagtcaaaggtgctgagattactggtaaggcaatc cagaaaggtgcttctaaactccgagagcggattcaaccagaagaaaaacccgtggaagtt agtccagctgtcaccaagggactttatatagcgaagcaagctacaggaggagcagcaaaa gtcagtcagttcctggttgatggagtttgcactgtagcaaattgcgttggaaaagaacta gctccacatgtcaagaagcatggaagcaaacttgttccagaatctcttaaaaaagacaaa gatgggaaatctcctctggatggtgctatggttgtagcagcaagtagtgttcaaggattt tcaactgtctggcaaggattggaatgtgcagctaaatgcatcgttaacaatgtttcagca gaaactgtacaaactgtcagatacaaatacggatataatgcaggagaagctacccaccat gcggtggattctgcggtcaatgttggcgtaactgcctacaatattaacaacattggtatc aaagcaatggtgaagaaaactgcaacacaaacaggacacactctccttgaggactatcag atagttgataattctcagagggaaaatcaagaaggagcagcaaatgtcaacgtgagaggg gagaaggatgagcagacgaaggaagtaaaggaggcaaagaagaaagataaatga