GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:22:52 Sequence gi568815578f:11818276_12023663 : 205388 bp : 40.03% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10331 10454 124 1 1 98 95 69 0.549 7.84 1.02 Intr + 23936 24022 87 0 0 100 38 55 0.555 0.72 1.03 Term + 24111 24166 56 1 2 121 47 86 0.906 4.54 1.04 PlyA + 24417 24422 6 1.05 2.03 PlyA - 24810 24805 6 1.05 2.02 Term - 28460 28373 88 1 1 97 47 69 0.411 -0.15 2.01 Init - 29512 29253 260 1 2 84 92 84 0.837 5.06 2.00 Prom - 31662 31623 40 -4.95 3.06 PlyA - 31803 31798 6 1.05 3.05 Term - 43079 43053 27 2 0 83 36 63 0.076 -2.30 3.04 Intr - 52452 52370 83 1 2 108 82 113 0.603 11.14 3.03 Intr - 53220 53132 89 2 2 28 68 18 0.218 -7.50 3.02 Intr - 53883 53711 173 0 2 27 51 196 0.489 7.62 3.01 Init - 56080 55976 105 1 0 83 92 72 0.451 7.37 3.00 Prom - 59235 59196 40 -3.55 4.00 Prom + 63423 63462 40 -4.05 4.01 Init + 72493 72679 187 0 1 58 101 125 0.131 8.07 4.02 Intr + 100059 100326 268 1 1 80 107 181 0.175 14.87 4.03 Intr + 100811 101192 382 2 1 80 66 115 0.400 2.49 4.04 Intr + 101443 101561 119 0 2 47 74 192 0.889 12.04 4.05 Term + 104359 105391 1033 1 1 83 48 1191 0.997 105.20 4.06 PlyA + 105756 105761 6 1.05 5.04 PlyA - 105907 105902 6 1.05 5.03 Term - 107133 106997 137 1 2 99 42 134 0.909 7.10 5.02 Intr - 111888 111779 110 2 2 87 57 49 0.045 0.71 5.01 Init - 120410 120289 122 2 2 21 77 119 0.359 3.81 5.00 Prom - 120920 120881 40 -3.45 6.00 Prom + 121515 121554 40 -6.75 6.01 Init + 128214 128262 49 2 1 86 58 59 0.476 1.88 6.02 Intr + 128371 128424 54 2 0 46 98 81 0.484 2.93 6.03 Intr + 129180 129338 159 1 0 67 54 75 0.207 1.04 6.04 Intr + 129576 129618 43 1 1 52 99 53 0.270 -0.82 6.05 Intr + 134759 134903 145 2 1 27 78 112 0.343 3.46 6.06 Intr + 140656 140792 137 0 2 13 78 113 0.185 1.25 6.07 Intr + 140922 141009 88 0 1 2 85 191 0.352 9.25 6.08 Intr + 145183 145230 48 0 0 86 110 11 0.210 1.06 6.09 Term + 151528 151716 189 0 0 74 50 112 0.281 2.47 6.10 PlyA + 152045 152050 6 -0.45 7.00 Prom + 153499 153538 40 -5.25 7.01 Init + 154826 154875 50 1 2 74 77 58 0.429 3.77 7.02 Intr + 156766 156851 86 1 2 114 27 83 0.194 3.34 7.03 Intr + 157429 157502 74 2 2 69 81 74 0.102 3.01 7.04 Intr + 174747 174866 120 2 0 69 91 57 0.192 3.87 7.05 Intr + 180050 180211 162 1 0 50 33 117 0.248 1.65 7.06 Intr + 183215 183496 282 1 0 58 101 102 0.629 5.19 7.07 Intr + 183938 184235 298 1 1 53 58 128 0.634 1.82 7.08 Term + 194269 194420 152 2 2 65 49 117 0.065 2.59 7.09 PlyA + 196153 196158 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:11818276_12023663|GENSCAN_predicted_peptide_1|88_aa VRVAPVGADSPHASMLGPNSPGHTSKDENKWHEVDKNSCIIVATGRFYFQLPFISHVTDS GQWNVARSDEEANGPEMAKPYHEKRLDL >gi568815578f:11818276_12023663|GENSCAN_predicted_CDS_1|267_bp gtaagggttgccccagtgggtgcagacagcccccatgcttccatgttaggcccaaacagt cctgggcatacatcaaaagatgaaaacaagtggcatgaggttgataaaaacagctgtatc atagttgccacaggaagattttatttccagctgcccttcatcagtcacgtgactgattct ggccaatggaatgtggccagaagtgatgaagaagcaaatggccccgagatggcaaagcca taccatgaaaagcgactggacctctga >gi568815578f:11818276_12023663|GENSCAN_predicted_peptide_2|115_aa MTSHAGYILKNNRWWNLELYSTQPGQHYVGFIGLHHYLHLTDKETKNQKDLKICPKIHWL SVSEPGLNLDLSDCGGCALSTHSVEDRLAQTFSYGGSKVSRVRKQICVHLLEAKA >gi568815578f:11818276_12023663|GENSCAN_predicted_CDS_2|348_bp atgaccagccatgcaggttatatactgaagaacaaccgttggtggaatttggaattatac agtacacaaccaggacagcattatgtggggttcataggtttgcatcattatcttcatctt acagataaggaaaccaaaaatcagaaggatttaaaaatttgcccaaagatacattggctg tcagtgtcagaaccaggattaaacctggatttgtctgattgtggaggctgtgctctatcc actcactctgttgaagacaggttagcgcagactttcagttatggtggcagcaaagtttca agggtgagaaagcagatatgtgtacatcttcttgaagctaaggcttga >gi568815578f:11818276_12023663|GENSCAN_predicted_peptide_3|158_aa MACGNASGRGGMFGRCSVKVEFMRIIGVMGNEVREEKKEKLKKEEKEKDEKKNKKRKEKE EENEKEKKRRNSSSCRRRRKRRRRRRECAEAGGTMETSVETCVTPKQHINSGYVSIRSSL WSGIPMLEAFLAYNNIQVDYYECLDLYGKQLTTDYRLN >gi568815578f:11818276_12023663|GENSCAN_predicted_CDS_3|477_bp atggcctgtggaaatgcctctgggagaggtggcatgtttggacgttgttcagtaaaggtg gaatttatgaggatcataggtgtgatgggaaatgaagttagagaggagaagaaggagaag ttgaagaaggaggagaaggagaaggatgagaagaaaaacaagaagaggaaggagaaggag gaggaaaatgagaaggagaagaagagaaggaacagcagcagttgcaggaggaggagaaag aggaggaggaggaggagggagtgtgcagaagctggggggactatggaaacgtctgtggaa acctgtgtaacccctaaacagcacattaactcagggtatgtcagcattaggtcatctctt tggtcagggattccaatgttggaagcatttttagcgtacaacaacatccaagtggattac tatgaatgcctagacctctatggcaaacagttgacaacagattatcggctgaattag >gi568815578f:11818276_12023663|GENSCAN_predicted_peptide_4|662_aa MPLRSLPVTGHARSLPQSVPCVRVPAEPAGRWISECPGRGLRSGHRARRRTAASPEARRS AGETVKNRSKKSSKKANTSSSSSNSSKLPPVCYEIITLKTKKKKMAADIFPRKKPANSSS TSVQQYHQQNLSNNNLIPAPNWQGLYPTIRERNAMMFNNDLMADVHFVVGPPGGTQRLPG HKVSNSCMTGLVLTFTKRDPFHKPVTWCGQLADVRQCMFHSIRERAHPLQRALATLNFFF VSFYTALYLTHPLLTLQTWEVVVTGQESRMFTLLLEISYYVLAVGSSVFHAMFYGELAED KDEIRIPDVEPAAFLAMLKYIYCDEIDLAADTVLATLYAAKKYIVPHLARACVNFLETSL SAKNACVLLSQSCLFEEPDLTQRCWEVIDAQAELALKSEGFCDIDFQTLESILRRETLNA KEIVVFEAALNWAEVECQRQDLALSIENKRKVLGKALYLIRIPTMALDDFANGAAQSGVL TLNETNDIFLWYTAAKKPELQFVSKARKGLVPQRCHRFQSCAYRSNQWRYRGRCDSIQFA VDKRVFIAGFGLYGSSCGSAEYSAKIELKRQGVVLGQNLSKYFSDGSSNTFPVWFEYPVQ IEPDTFYTASVILDGNELSYFGQEGMTEVQCGKVTVQFQCSSDSTNGTGVQGGQIPELIF YA >gi568815578f:11818276_12023663|GENSCAN_predicted_CDS_4|1989_bp atgccgctccgctccctcccggtgaccgggcacgcgcgctcgctcccgcagagcgtgccc tgcgtgcgggtgcccgccgagcccgccgggcgctggatctccgagtgccccggccggggc ctgaggagcgggcacagggcaaggcggcggacggctgcgtcgcccgaggcgaggaggagc gctggcgagacggtaaagaacaggtccaagaaaagctcaaagaaagcaaataccagcagc agcagtagcaacagcagcaagttgccaccagtttgttatgaaataattaccttgaagact aaaaagaagaagatggctgctgatatattcccccgtaaaaagccagccaactccagcagc accagcgtccagcagtaccaccagcagaatctcagtaacaacaaccttatcccggcccca aactggcagggtctttatcccaccattagagagagaaatgcgatgatgttcaataatgat ttgatggcagatgtacattttgtggttgggccaccaggtgggactcaacggttgccagga cacaaagtaagcaacagctgcatgaccggtttagtcctgacgtttacaaagagggaccct ttccataagcctgtaacttggtgtgggcagcttgccgatgtcaggcagtgcatgtttcac tcgattagggagagagcgcaccctctccagagggctttggccacgcttaattttttcttt gtttccttctatactgctttatatctcacacatcccctcttaactctccagacatgggaa gttgttgtgacaggtcaggaaagtcgtatgtttacccttctcctagaaattagttattat gttttagctgttgggagctctgtgttccatgcgatgttttacggagaacttgcagaggac aaagatgaaatccgtataccagatgtcgaacctgctgcttttctcgctatgctgaaatat atctattgtgatgaaattgacttggctgctgacacagtgctggccacactttatgctgcc aaaaagtacattgtccctcaccttgccagagcctgtgttaatttcctggagaccagcctg agtgccaagaatgcctgtgtgctcctctcccagagctgcctgttcgaggagccagacctg acccagcgttgctgggaggtgattgatgcccaggctgagttagctctcaagtctgaggga ttctgcgatattgacttccagacactagaaagtattctccgtagggaaactctgaatgcc aaagaaattgtggtttttgaggcagctctcaactgggctgaagtagaatgccaacgacaa gatctggcgttgagcattgaaaataaacgcaaggttctaggaaaggcactttacttgatc cgcatacccacaatggccctcgatgattttgcaaatggtgctgcacagtccggggtatta actctcaatgagaccaacgacatcttcctctggtatactgcagccaaaaagcctgagctt cagtttgtgagtaaagcccgtaagggccttgtcccccagcgctgtcaccgtttccagtcg tgtgcctatcgaagcaaccaatggcgctatcgtggtcgctgtgacagcatccagtttgca gttgataaaagagtgttcattgctggctttgggctgtatggctccagctgtggttctgca gaatacagtgccaagattgaacttaagcggcagggcgttgtcctggggcagaacttgagc aagtacttctcagatgggtccagcaatacctttcccgtatggtttgaatacccagtgcag atcgagccagacaccttctacacagccagtgtgatactggatggcaatgaactcagctac tttggacaagaaggcatgacagaagttcagtgtggcaaagtgactgtccagtttcagtgc tcctcagatagcaccaatggcactggggtacagggagggcagatccctgaacttatattc tatgcttga >gi568815578f:11818276_12023663|GENSCAN_predicted_peptide_5|122_aa MSDGKRSQRYQIEKSEGEFIIPLPLCSIQALNRLDDAYPYWYYCQCFRNNVFHSHNDHFE EVTIVTAIILEGTEAQRGKPAEGNKPTSTGEEVAQRHSEYVRAVTHGPCWGQQEDHYCPT SY >gi568815578f:11818276_12023663|GENSCAN_predicted_CDS_5|369_bp atgtctgatggcaagagaagtcagaggtaccagatcgagaagagtgaaggagaattcatc attcctctgcctttgtgttctattcaggccctcaatagattggatgatgcctatccatat tggtactattgtcagtgctttagaaataatgtatttcatagtcataatgaccattttgag gaagtcactattgttactgccatcattcttgagggaactgaggctcagagagggaagccg gctgaaggcaacaagcccacatctacaggggaagaagtggcccagaggcacagtgaatac gtcagggcagtcactcatgggccttgctggggccagcaggaggaccattactgtcccacc tcttactag >gi568815578f:11818276_12023663|GENSCAN_predicted_peptide_6|303_aa MGFRHVGQAALKLLTSEYNWSINSSNKTKWYNAEEAVCLNIHQALSTVDESAPLISVTVN GVALRYSFLTLSTLEEIYIVLLDNNNHGRGEPGLDEAVWALSLGLGVFVDQPQLLLNPRP VKEGTLIASTHGLWEIPGHLVIPGCKDSWELQKPDSIFTLISRSINQASRMCELRRQRSS SGGAGLRLPRFPFRRPNEYKEEDSEERDHGEDSDTYKDDDEDDELARPDGLEQVCLVVCN QKSLLSESGPCFSSPYQLADLYSCYILLCHSPESSAKTPSGQLQAMGKSLGCVCSGITVS MDT >gi568815578f:11818276_12023663|GENSCAN_predicted_CDS_6|912_bp atggggtttcgccatgttgggcaggctgctctcaaactcctgacctcagagtacaattgg agcatcaacagcagcaataaaaccaaatggtacaacgctgaagaggctgtttgcttaaat atacatcaagccctttctactgtggatgaatctgctccattaatatctgtcactgttaat ggagttgctttgcgctactcctttctcactctgtccactcttgaggaaatttacattgtc ctcttagataataataaccatggcagaggagagccaggactggatgaagcggtgtgggca ctgagcttgggccttggagtctttgtagatcagcctcagttgcttcttaacccaagacct gtaaaagaagggacactcatcgcttccactcatggtctgtgggagatacctggtcatttg gtcatacctggatgcaaggacagctgggagctccagaagcctgacagtattttcacgttg atctcccgttcgataaatcaagcctcgcggatgtgtgaactgcgacgccaacgcagtagc tctggaggagcggggctgcggctgccgcgcttcccgtttcgcaggccaaatgagtacaag gaagaagattcagaagaaagagaccatggagaagatagtgatacttataaagatgatgat gaggatgatgagctggccagacctgatggtcttgagcaggtttgcttagtagtctgcaac cagaagtcattgctgagtgaaagtggaccttgtttttcttcaccctaccaattggctgac ttgtattcatgctacatattgctctgccattcaccagaaagttcggcgaaaacacctagt ggtcaactacaggccatggggaaaagtctgggctgtgtctgctctgggatcacagtgagc atggacacttag >gi568815578f:11818276_12023663|GENSCAN_predicted_peptide_7|407_aa MQHQIHTGGHGQGEDTRWTYARLLIERQQHNEAGASQNPYLNKGNDFSTGLLDCSHDKRL LGFLQSKKSMVQTFYLFNLNLKNLIFSARAQRFCPGHGGPRRAGLGCQHQDRPRIALYYI QMLTKLMTKARRDKPTKCLLTSLRGDHFWDADWGQLTTSRMLIKSKVNDAAGYYKAFYLD LMSMPVKASSQRLTSQGCYNFDYFWMEALLSLFRCLSIMPRVSTAPEPSFHKTKEIACCG FHRSLWSLFRFGHFEVNASGIVKGKNEKDFRTWRWGKEELSKNSKGLERQTPATTDTRLD SCISSKPAVLYYGQPSTSIDTTWKLHRTAESQAAQDLQNQNLWGQLGSRNLCLNEPPAEA CFLRYELTSISALFGGSREKAHRQGVGSCLQQVTVTGAPIPFGIMTF >gi568815578f:11818276_12023663|GENSCAN_predicted_CDS_7|1224_bp atgcaacaccaaattcatactggtggccatggtcaaggggaagacacaagatggacctat gcaagacttctgattgaaagacagcagcataatgaagctggtgcttcacagaatccatat ttgaataaagggaatgacttctctacaggtctgctcgactgttctcatgacaagcggctg ctgggctttctgcagagcaagaagtccatggtacagacattttacctctttaatcttaat ttaaaaaacctcatcttctctgcccgagcccaacgcttctgcccgggacatggaggacct agacgtgcaggactgggctgtcagcaccaggatagaccaagaattgcactttactacatc caaatgctgacgaaactgatgacaaaagctagaagagacaagcccaccaagtgccttctg acctctctcaggggggaccacttctgggatgctgattggggtcagctgaccacttccagg atgctgattaagtcaaaggtaaatgatgcagctggatattacaaagctttctaccttgac ttgatgtctatgcctgttaaagcaagcagccagcgtctgacatctcagggttgttacaac tttgattatttttggatggaggcactgctgtccttatttcggtgcctctcaattatgcct cgtgtctcaactgctcctgaaccttcattccataaaacaaaagagattgcctgctgcggt tttcataggtctttgtggagtctctttagatttgggcattttgaggtcaatgcgtcagga attgttaagggcaagaatgagaaagatttcaggacttggaggtgggggaaagaggagcta tcaaagaatagcaagggccttgaaagacaaaccccagctactactgacaccaggctggac agctgcatatcctccaagccagcagtcctatactatggtcagcctagcaccagcattgac actacctggaagcttcatagaactgcagaatctcaggctgctcaagatttgcagaatcag aatctctgggggcagctggggtccagaaatctgtgtttgaatgagcccccagccgaagcc tgcttcttgaggtatgaactcaccagcatttcagccctgtttggtgggagcagagagaaa gcccacagacagggagttggcagctgcctgcagcaggtcaccgttactggggcaccaata ccatttggtataatgaccttctag