GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:40:38 Sequence gi568815581f:67937133_68146593 : 209461 bp : 44.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3492 3524 33 2 0 101 113 -18 0.608 1.90 1.02 Intr + 7018 7240 223 0 1 106 80 127 0.720 11.40 1.03 Intr + 8277 8320 44 1 2 81 88 -12 0.575 -3.94 1.04 Intr + 8843 9193 351 1 0 91 103 177 0.850 15.02 1.05 Intr + 10594 10676 83 0 2 52 97 36 0.802 -0.66 1.06 Intr + 10949 11174 226 2 1 59 110 257 0.814 22.99 1.07 Intr + 22610 22743 134 1 2 89 78 83 0.868 6.84 1.08 Intr + 26201 26374 174 2 0 56 79 127 0.983 7.65 1.09 Intr + 27080 27272 193 2 1 71 94 161 0.995 14.49 1.10 Intr + 29440 29524 85 0 1 106 106 31 0.928 6.09 1.11 Term + 38640 38830 191 1 2 60 53 178 0.371 9.11 1.12 PlyA + 38949 38954 6 1.05 2.05 PlyA - 41039 41034 6 1.05 2.04 Term - 43539 43378 162 0 0 90 43 71 0.257 0.74 2.03 Intr - 56103 55835 269 1 2 31 37 494 0.176 35.85 2.02 Intr - 56852 56292 561 0 0 119 97 387 0.999 35.49 2.01 Init - 59066 58991 76 2 1 77 69 45 0.776 2.20 2.00 Prom - 69429 69390 40 -5.16 3.00 Prom + 69466 69505 40 -5.66 3.01 Init + 78573 78575 3 2 0 60 101 0 0.455 -1.50 3.02 Intr + 82845 83085 241 2 1 56 50 243 0.756 14.42 3.03 Intr + 98646 98708 63 1 0 3 92 108 0.021 1.39 3.04 Intr + 99978 100075 98 1 2 71 79 56 0.031 2.73 3.05 Intr + 100226 100363 138 1 0 36 98 173 0.976 13.76 3.06 Intr + 103546 103634 89 0 2 113 111 83 0.999 11.87 3.07 Intr + 104953 105221 269 1 2 121 80 53 0.905 4.98 3.08 Intr + 105773 105867 95 0 2 86 92 22 0.946 2.08 3.09 Intr + 105968 106231 264 1 0 109 103 183 0.998 19.61 3.10 Intr + 106706 106939 234 1 0 97 92 133 0.963 12.49 3.11 Intr + 107189 107371 183 1 0 70 111 124 0.999 12.88 3.12 Intr + 108640 108789 150 0 0 101 105 80 0.974 11.36 3.13 Term + 158428 158628 201 0 0 32 48 227 0.493 10.49 3.14 PlyA + 158964 158969 6 1.05 4.00 Prom + 160642 160681 40 -4.96 4.01 Init + 161488 161492 5 0 2 76 55 0 0.280 -5.03 4.02 Intr + 162456 162576 121 0 1 63 56 99 0.423 4.70 4.03 Intr + 163985 164106 122 1 2 67 36 81 0.734 0.09 4.04 Intr + 164580 164827 248 0 2 97 72 154 0.607 11.80 4.05 Term + 185681 185799 119 0 2 61 54 91 0.009 1.70 4.06 PlyA + 186403 186408 6 1.05 5.09 PlyA - 186470 186465 6 1.05 5.08 Term - 189188 189153 36 2 0 82 44 40 0.492 -3.66 5.07 Intr - 189643 189345 299 2 2 100 78 305 0.846 27.29 5.06 Intr - 191149 191009 141 2 0 86 38 162 0.928 11.32 5.05 Intr - 196601 196473 129 0 0 51 55 102 0.491 3.87 5.04 Intr - 197723 197539 185 1 2 -5 -3 158 0.002 -3.17 5.03 Intr - 197962 197850 113 1 2 -19 50 234 0.159 8.08 5.02 Intr - 198317 198074 244 1 1 -16 38 518 0.128 33.90 5.01 Init - 198472 198333 140 1 2 87 105 207 0.968 21.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100075 75 1 0 73 79 95 0.943 8.19 S.002 Term + 109372 109464 93 0 0 59 36 65 0.857 -3.77 S.003 Term - 122881 122751 131 2 2 89 45 74 0.831 1.54 S.004 Term - 143713 143613 101 2 2 70 39 91 0.840 0.69 S.005 Init - 185079 185021 59 0 2 92 74 99 0.908 9.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:67937133_68146593|GENSCAN_predicted_peptide_1|578_aa MAQLTQLTQGHGGNQGLTVVIQGQGQTTGQLQLIPQGVTVLPGPGQQLMQAAMPNGTVQR FLFTPLATTATTASTTTTTVSTTAAGTGEQRQSKLSPQMQLQIQQPQPQVIAVPQLQQQV QVLSQIQSQVVAQIQAQQSGVPQQIKLQLPIQIQQSSAVQTHQIQNVVTVQAASVQEQLQ RVQQLRDQQQKKKQQQIEIKREHTLQASNQSEIIQKQVVMKHNAVIEHLKQKKSMTPAER EENQRMIVCNQVMKYILDKIDKEEKQAAKKRKREESVEQKRSKQNATKLSALLFKHKEQL RAEILKKRALLDKDLQIEVQKRKREEEKDSSSKSKKKKMISTTSKETKKDTKLYCICKTP YDESKFYIGCDLCTNWYHGECVGITEKEAKKMDVYICNDCKRAQEGSSEELYCICRTPYD ESQFYIGCDRCQNWYHGRCVGILQSEAELIDEYVCPQCQSTEDAMTVLTPLTEKDYEGLK RVLRSLQAHKMAWPFLEPVDPNDAPDYYGVIKEPMDLATMEERVQRRYYEKLTEFVADMT KIFDNCRYYNPSDSPFYQCAEVLESFFVQKLKGFKASR >gi568815581f:67937133_68146593|GENSCAN_predicted_CDS_1|1737_bp atggctcaacttactcagttaacacagggccacggtggcaatcaaggtttgacagtagta attcaaggacaaggtcaaactactggacagttgcagttgatacctcaaggggtgactgta ctcccaggcccaggccagcagctaatgcaagctgcaatgccaaatggtactgttcagcga ttcctctttaccccattggcaacaacagccaccacagccagcaccaccaccaccactgtt tccacgacagcagcaggtacaggtgaacaaaggcagagtaaactgtcaccccagatgcag ctacaaatacagcagccacagccccaagtcattgctgtgcctcagctgcaacaacaagtc caggttctctctcagatccagtcacaggttgtggctcagatacaggctcagcaaagtggt gtgccccagcaaatcaaactccagttacctatccaaattcagcaaagcagtgctgtgcag actcaccagattcagaatgtggttacagtgcaggcagccagtgtgcaagagcagttgcaa agggttcagcaactcagggatcagcagcaaaagaagaaacagcaacagatagaaattaag cgtgaacacaccctccaagcttctaatcaaagtgaaatcattcagaaacaggtggtgatg aagcataatgctgtaatagaacatttaaaacagaaaaagagcatgactccagctgaaaga gaagagaatcaaagaatgattgtctgtaaccaggtgatgaagtatattttggataagata gataaagaagaaaaacaggcagcaaaaaaacggaagcgtgaagagagtgtggagcagaaa cgtagcaagcagaatgccactaagctgtcagctctgctcttcaagcacaaagagcagctc agagccgagatcctgaagaagagagcactcctggacaaggatctgcaaattgaagtgcag aagaggaagcgggaagaggaaaaagactccagctcaaagtccaagaaaaagaaaatgatc tctactacctcaaaggaaactaagaaggacacaaagctttactgtatctgtaaaacgcct tatgatgaatctaagttctatattggctgtgatctttgtactaactggtatcatggagaa tgtgttggcatcacagaaaaggaggctaagaaaatggatgtgtacatctgtaatgattgt aaacgggcacaagagggcagcagtgaggaattgtactgtatctgcagaacaccttatgat gagtcacaattttatattggctgtgatcggtgtcagaattggtaccatgggcgctgcgtt ggcatcttgcaaagtgaggcagagctcattgatgagtatgtctgtccacagtgccagtca acagaggatgccatgacagtgctcacgccactaacagagaaggattatgaggggttgaag agggtgctccgttccttacaggcccataagatggcctggcctttccttgaaccagtagac cctaatgatgcaccagattattatggtgttattaaggaacctatggaccttgccaccatg gaagaaagagtacaaagacgatattatgaaaagctgacggaatttgtggcagatatgacc aaaatttttgataactgtcgttactacaatccaagtgactccccattttaccagtgtgca gaagttctcgaatcattctttgtacagaaattgaaaggcttcaaagctagcaggtga >gi568815581f:67937133_68146593|GENSCAN_predicted_peptide_2|355_aa MTARAFWLLCLIVGSSPEAPVAERKTSPPHSRKPDSRGCPSAEETPGPRAQPLLEAPQRP RAAEVAPAARAWPDPRRRKPPPPADNQASFREAARAPAGPPGPRLAQAENRASPRREPAS EDAPRRARSRALRFPAARPPALATEGSAGHAHPNRPRAAALAPGPAPAPPPRFSLSFGLS PPQRDAEPGAEPCARACRSDLDEREAFCESEFAVNGIVHDVDVLGAGIRLVTLLVDRDGL YKMNRLYLTPDGFFFRVHMLALDSSSCNKPCPEFKPGIETDLNDAAYVLYTTVCNVGATA RASPGPPGNFLSFISSCQQSSCGLCQQAGSRVGFTLLGFYLTVTTKGLPNLKKQF >gi568815581f:67937133_68146593|GENSCAN_predicted_CDS_2|1068_bp atgacagctagagctttctggctcctctgtttgatcgtcggatcatcccccgaagcaccg gtggcggagagaaaaacctcgccgccgcacagcagaaagccggactcccgcggctgcccg agcgcggaggagacaccggggccccgcgcgcagccactccttgaggccccgcagcggccg cgcgcggccgaggtcgctcccgccgcccgcgcctggcctgacccgcgccgccggaagccc ccaccgccggccgacaaccaggccagcttccgggaggccgcacgcgcgcccgctggcccg ccgggcccgcgcctggcgcaagccgagaaccgcgcgtcgccgcgccgcgagcccgcgtcg gaggacgccccgcgacgcgcgcgctcacgggccctgcgcttccccgccgcccggccgccc gcgctcgccaccgagggctccgccggccacgcccaccccaaccggccgcgcgccgccgcg ctggccccgggacccgcgcccgcgccgccgccgcgcttcagcctcagcttcggcctcagc ccgccgcagagggacgcggagcccggcgctgagccctgcgcgcgcgcctgccggtccgat ctggacgaacgcgaggcgttctgcgagagcgaattcgcggtgaacgggatcgtgcacgat gtggacgtgctgggcgcgggcatccggctggtgaccctgctggtggatcgggacgggctg tacaagatgaaccgcctgtacctcacccccgacggcttcttcttccgagtccacatgtta gccctggactcctccagctgcaataagccgtgtccagagtttaaacctggtattgaaact gacctgaatgacgctgcatatgtactttataccaccgtttgtaacgtgggtgccacagcc cgggcttccccaggacctccaggcaatttcctctccttcatcagcagttgccagcagtca tcctgtgggctctgccagcaagctggcagcagggtaggattcacactccttggcttctac ctcacagttaccaccaaaggccttcccaatttaaaaaagcaattttag >gi568815581f:67937133_68146593|GENSCAN_predicted_peptide_3|675_aa MDPASHTLSLRDAIITLPPPRIAVSRREASDLAAAPSGGGARLRFSASLSAAWTSRALRH GPGAAARAPEGLRVTLRALLPVALQPLIPPASPPVEAQARFAAFSLCLITMSTNENANTP AARLHRFKNKGKDSTEMRRRRIEVNVELRKAKKDDQMLKRRNVSSFPDDATSPLQENRNN QGTVNWSVDDIVKGINSSNVENQLQATQAARKLLSREKQPPIDNIIRAGLIPKFVSFLGR TDCSPIQFESAWALTNIASGTSEQTKAVVDGGAIPAFISLLASPHAHISEQAVWALGNIA GDGSVFRDLVIKYGAVDPLLALLAVPDMSSLACGYLRNLTWTLSNLCRNKNPAPPIDAVE QILPTLVRLLHHDDPEVLADTCWAISYLTDGPNERIGMVVKTGVVPQLVKLLGASELPIV TPALRAIGNIVTGTDEQTQVVIDAGALAVFPSLLTNPKTNIQKEATWTMSNITAGRQDQI QQVVNHGLVPFLVSVLSKADFKTQKEAVWAVTNYTSGGTVEQIVYLVHCGIIEPLMNLLT AKDTKIILVILDAISNIFQAAEKLGETEKLSIMIEECGGLDKIEALQNHENESVYKASLS LIEKYFSVEELEKLCMSDKLWEQIVQSICDTITPDVRSLAEDTGWRQLFFTNKLQLQRQL RKRKQKYGSLREKQP >gi568815581f:67937133_68146593|GENSCAN_predicted_CDS_3|2028_bp atggatccggcttctcacaccctctccttacgcgatgccatcatcacgctcccgcccccg cgcatcgcggtctcgcgccgagaggcaagtgatttggcagccgcccccagcggcggcggc gcgcgcctgcgcttttctgcgtccctgtcggccgcctggacttcccgcgcgctgcgccat gggcccggagcggccgcaagggccccggagggcctgcgcgtgaccctccgagctctcctg cccgtcgccctacagccgctgattccccccgcatcgcctcccgtggaagcccaggcccgc ttcgcagctttctccctttgtctcataaccatgtccaccaacgagaatgctaatacacca gctgcccgtcttcacagattcaagaacaagggaaaagacagtacagaaatgaggcgtcgc agaatagaggtcaatgtggagctgaggaaagctaagaaggatgaccagatgctgaagagg agaaatgtaagctcatttcctgatgatgctacttctccgctgcaggaaaaccgcaacaac cagggcactgtaaattggtctgttgatgacattgtcaaaggcataaatagcagcaatgtg gaaaatcagctccaagctactcaagctgccaggaaactactttccagagaaaaacagccc cccatagacaacataatccgggctggtttgattccgaaatttgtgtccttcttgggcaga actgattgtagtcccattcagtttgaatctgcttgggcactcactaacattgcttctggg acatcagaacaaaccaaggctgtggtagatggaggtgccatcccagcattcatttctctg ttggcatctccccatgctcacatcagtgaacaagctgtctgggctctaggaaacattgca ggtgatggctcagtgttccgagacttggttattaagtacggtgcagttgacccactgttg gctctccttgcagttcctgatatgtcatctttagcatgtggctacttacgtaatcttacc tggacactttctaatctttgccgcaacaagaatcctgcacccccgatagatgctgttgag cagattcttcctaccttagttcggctcctgcatcatgatgatccagaagtattagcagat acctgctgggctatttcctaccttactgatggtccaaatgaacgaattggcatggtggtg aaaacaggagttgtgccccaacttgtgaagcttctaggagcttctgaattgccaattgtg actcctgccctaagagccatagggaatattgtcactggtacagatgaacagactcaggtt gtgattgatgcaggagcactcgccgtctttcccagcctgctcaccaaccccaaaactaac attcagaaggaagctacgtggacaatgtcaaacatcacagccggccgccaggaccagata cagcaagttgtgaatcatggattagtcccattccttgtcagtgttctctctaaggcagat tttaagacacaaaaggaagctgtgtgggccgtgaccaactataccagtggtggaacagtt gaacagattgtgtaccttgttcactgtggcataatagaaccgttgatgaacctcttaact gcaaaagataccaagattattctggttatcctggatgccatttcaaatatctttcaggct gctgagaaactaggtgaaactgagaaacttagtataatgattgaagaatgtggaggctta gacaaaattgaagctctacaaaaccatgaaaatgagtctgtgtataaggcttcgttaagc ttaattgagaagtatttctctgtagaggaattggagaagctgtgcatgtctgataaactg tgggaacagatagtccagtcgatctgcgacaccatcactcctgacgtgaggtccctggcg gaggacacgggctggagacagctgttcttcaccaacaagctccagctccagcggcagctc cgcaagaggaaacaaaaatatggaagcttgagagaaaagcaaccttag >gi568815581f:67937133_68146593|GENSCAN_predicted_peptide_4|204_aa MSPSLPDKPCPLLTAPRPIDHPRAEECEHMAQDWQAAPPAAPVSPGWRRQLQDQEQHSWG FAGTIGVGMLSLPSPEIRQVPGGPRLEEGKAVASAPSAAHGDDAWHIRDPQAAAVASPAA RAPAPAAVVTPPRLAVAPVPPAAPAARDMSNLGGRRNGPVEVRLTVAPGHLDEYLQITGV NDGLAWAQRPDIEDPSLSFSDALL >gi568815581f:67937133_68146593|GENSCAN_predicted_CDS_4|615_bp atgagcccgagcctccccgacaagccctgccccctgctcacggcgcccaggcccatcgac cacccaagggctgaggagtgcgagcacatggcacaggactggcaggcagctccacctgca gccccggtgagtcccggatggaggcgtcaactacaagaccaggaacagcactcctggggc tttgctggaaccatcggcgtgggaatgctgtctcttccctcgccagaaatacgacaagtt cccggcgggcctcgcctagaggagggaaaggccgtggcgtccgctccctccgcggctcat ggcgacgacgcttggcacatccgggaccctcaggccgctgcagtcgcgtcgccggctgct cgggccccagccccggccgctgtggtgactccgccgcgcctcgccgtcgcccccgtcccg cccgccgccccagccgccagggacatgtctaacctcggaggccgtaggaacgggcctgtc gaagtgcgcctgacagttgctccaggccacctggacgaatacctgcagatcacaggggtt aacgatggcttagcttgggctcagcggcctgacattgaggatccttctttatctttttct gatgcccttctatga >gi568815581f:67937133_68146593|GENSCAN_predicted_peptide_5|428_aa MSVAGLKKQFYKASQLVSEKVGGAEGTKLDDDFEEMEKVDVISKAVTTIEYPQPNPASQA KLTMLNTACKIRGQVKNPGYLQSGGLLGECLIRHGKELRDESNFSDALLDAGEPMKHLAE VKDSLDIEKRQGKIPEEELHQALEKFEESKEVAETSMHNLLETDIQREYKPKPRECFDLG KPEQSNRGFPCTTVPKIAASSSFRSYDKSIWTPSRSMPPLDQPSCKGSRPTEACVTLAQA ATTSHLHNPQHTLHGTTAPAVLLRAKHKGSSEEASVGNPERVFMKVLQAQKKHMSIELTT EPEAASDSSGINLSVFGSDQFEIPLTHQLQSVIPNNDVRSFISHVIWTLKTDCSETYVQV TCAKLISRTGLLMRLLSEQQEVKASKAEWDTDQWKTKNYINESTEAQSEQKEQKSSERVK QISAQDEL >gi568815581f:67937133_68146593|GENSCAN_predicted_CDS_5|1287_bp atgtcggttgccgggctgaagaagcagttctacaaggcgagccaactggtcagtgagaag gtcggaggggctgaggggaccaagctggatgatgacttcgaagagatggagaaggtggat gtcatcagcaaggcggtgacgaccatcgagtacccgcagcccaacccagcctcgcaggct aaactgaccatgctcaacaccgcgtgcaagatccggggccaggtgaagaaccccggctac ctgcagtcaggggggctcctgggcgagtgcctgatccgccatgggaaggagctgcgcgac gagtccaacttcagcgatgcactgctggatgccggcgagcccatgaagcacctggcagag gtgaaggactccctggacatagagaagcggcagggcaagatccccgaagaggagctgcac caggcgctggagaagtttgaggagtccaaggaggtggcagaaaccagcatgcacaacctc ctggagaccgacatccagcgggagtataagcccaagccccgggagtgctttgacctcgga aagcctgagcagtccaaccggggcttcccctgcaccacagtccccaagatcgcagcttcg tcctctttccgatcttacgacaagtccatctggactcctagcaggagcatgccgccccta gaccagccgagctgcaagggctccaggcccacagaagcctgtgtcactctggcacaagct gccaccaccagccacctacacaaccctcagcacaccttgcacgggaccacagccccagct gtgctgctgagggccaagcacaaaggctccagtgaagaagcatctgtagggaatccagaa agagtgttcatgaaggtgttacaagcccagaagaagcacatgagcattgagctgactact gagccggaggcagcctcagacagcagtggcatcaacttgtcagtctttgggagtgatcag tttgaaattccgctaacccatcaactacagtccgtcatccccaacaacgatgtgagaagc ttcatttctcatgttatctggaccttgaagacggactgctccgagacctatgtgcaagtg acctgtgccaagctcatctccaggacaggcctcctgatgaggcttctcagtgagcagcag gaagtaaaggcgtccaaggcagaatgggatacagaccagtggaaaactaagaactatatt aatgaaagcacagaagcccagagtgaacagaaagagcagaagtcgagtgagagagtgaaa cagattagtgcacaggatgaactgtag