GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:00:33 Sequence gi568815587r:107404661_107660893 : 256233 bp : 37.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 11638 11549 90 1 0 48 107 31 0.262 0.27 1.02 Intr - 13627 13534 94 0 1 46 95 110 0.532 6.55 1.01 Init - 18920 18874 47 2 2 71 87 45 0.809 2.91 1.00 Prom - 22276 22237 40 -4.15 2.09 PlyA - 22345 22340 6 1.05 2.08 Term - 24791 24120 672 2 0 61 37 412 0.912 26.16 2.07 Intr - 36962 36843 120 2 0 99 110 96 0.999 12.67 2.06 Intr - 38389 38279 111 1 0 39 121 110 0.978 9.06 2.05 Intr - 49490 49374 117 2 0 42 84 53 0.412 0.04 2.04 Intr - 51116 51006 111 2 0 92 68 167 0.977 14.76 2.03 Intr - 53163 53052 112 2 1 34 82 173 0.299 10.76 2.02 Intr - 61206 61019 188 0 2 62 71 162 0.549 9.47 2.01 Init - 64083 63784 300 0 0 46 10 171 0.340 2.40 2.00 Prom - 64821 64782 40 -2.75 3.05 PlyA - 65355 65350 6 1.05 3.04 Term - 77988 77650 339 0 0 42 48 147 0.051 -0.35 3.03 Intr - 89721 89602 120 0 0 128 47 35 0.214 3.17 3.02 Intr - 90313 90207 107 2 2 -12 71 156 0.068 2.91 3.01 Init - 93229 93112 118 1 1 79 89 96 0.091 9.21 3.00 Prom - 94493 94454 40 -5.15 4.16 PlyA - 94572 94567 6 1.05 4.15 Term - 100555 99998 558 1 0 81 38 407 0.303 28.36 4.14 Intr - 106376 106227 150 2 0 83 92 115 0.986 10.84 4.13 Intr - 112677 112560 118 2 1 73 83 -10 0.078 -3.45 4.12 Intr - 115214 115190 25 0 1 104 92 17 0.090 -0.13 4.11 Intr - 117895 117635 261 2 0 18 34 210 0.204 5.14 4.10 Intr - 120932 120781 152 1 2 73 86 99 0.852 7.09 4.09 Intr - 127746 127640 107 0 2 65 71 98 0.035 3.89 4.08 Intr - 145163 145093 71 0 2 98 84 45 0.900 3.08 4.07 Intr - 147252 147148 105 1 0 63 113 76 0.974 6.87 4.06 Intr - 148543 148448 96 2 0 102 98 82 0.993 9.66 4.05 Intr - 149318 149187 132 0 0 77 93 115 0.999 10.60 4.04 Intr - 152343 152106 238 0 1 71 62 271 0.999 19.06 4.03 Intr - 156239 156105 135 2 0 110 60 95 0.923 8.74 4.02 Intr - 156378 156369 10 2 1 82 94 9 0.479 -5.76 4.01 Init - 161043 160970 74 0 2 98 81 52 0.814 6.09 4.00 Prom - 165047 165008 40 -5.95 5.11 PlyA - 165449 165444 6 1.05 5.10 Term - 165682 165524 159 1 0 -15 52 217 0.173 4.76 5.09 Intr - 172775 172628 148 1 1 80 82 33 0.131 1.22 5.08 Intr - 174340 174226 115 2 1 98 26 109 0.341 4.29 5.07 Intr - 178478 178319 160 2 1 73 100 109 0.907 9.24 5.06 Intr - 182225 182093 133 1 1 45 60 99 0.351 2.53 5.05 Intr - 186676 186242 435 0 0 80 21 173 0.018 1.57 5.04 Intr - 186794 186734 61 0 1 88 66 25 0.081 -2.83 5.03 Intr - 187389 187159 231 1 0 82 -29 215 0.075 6.02 5.02 Intr - 187882 187678 205 1 1 67 20 131 0.762 2.05 5.01 Init - 197350 197258 93 1 0 92 73 66 0.324 5.93 5.00 Prom - 219032 218993 40 -4.65 6.00 Prom + 222949 222988 40 -4.35 6.01 Init + 225893 225902 10 1 1 75 111 4 0.283 1.83 6.02 Intr + 226040 226068 29 0 2 98 115 8 0.815 1.42 6.03 Intr + 229224 229331 108 2 0 13 86 115 0.589 3.36 6.04 Intr + 230995 231105 111 0 0 56 96 94 0.839 6.66 6.05 Intr + 242808 242941 134 2 2 87 92 135 0.994 12.32 6.06 Intr + 245675 245743 69 2 0 83 91 106 0.889 7.78 6.07 Term + 247110 247296 187 0 1 31 51 96 0.539 -3.82 6.08 PlyA + 247403 247408 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 90326 90207 120 2 0 58 71 165 0.888 12.04 S.002 Term - 143643 143485 159 0 0 13 52 152 0.814 1.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:107404661_107660893|GENSCAN_predicted_peptide_1|77_aa MEELDITLVGKKNLVSSPERESIHILSVDEKNKLGAKIIKAEMMGNMELAEQLKVQLEKA NKFKETITQIPKKSGVE >gi568815587r:107404661_107660893|GENSCAN_predicted_CDS_1|231_bp atggaggaactagacataactttagttggaaagaaaaacctggtcagcagtccagagcgt gagtccattcacatcctgagtgttgatgagaagaacaagttgggagccaagattatcaaa gcagagatgatggggaatatggaattagctgaacaacttaaagttcaacttgaaaaggca aataaattcaaagaaactataacacagataccaaaaaaatctggggtagag >gi568815587r:107404661_107660893|GENSCAN_predicted_peptide_2|576_aa MRKYSRGWWSHKESGPLDGGEAPSPVSDYPPMGGPLANSVHTCLIYLLYSGVPLLQQLSL YPTKYSDLAEERKSGVGTNIPSLWNPSDSSEEVVFSQVLKPPLMIPRQTGSGVDLQQTLA DLQQRGLTVRRKTNKQKGTASTPTKRTSTQKPHLKVTNIKDQSFTMATSMAAASGRFESA KSIEERKEQTRNARAEVLRQAKANFEKEERRKELKRLRGEDTWMLPDVNERIEQFSQALG LCNQQVMKPARPASFPSGWQVPPGLRVGPEVPSGTQSSEDEWVEAVPSQTPDKEKAWKVK DEKSGKDDTQIIKRDEWMTVDFMSVKTVSSSSLKAEKETMRKIEQEKNQALEQSMEIFQS KLEDAEKAASTKEDYRRERWRKPTYSDKAQNCQESRESDLVKYGNSSRDRYATTDTAKNS NNEKFIGDEKDKRPGSLETCRRESNPRQNQEFSFGNLRAKFLRPSDDEELSFHSKGRKFE PLSSSSALVAQGSLCSGFRKPTKNSEERLTSWSRSDGRGDKKHSNQKPSETSTDEHQHVP EDPREKSQDEVLRDDPPKKEHLRDTKSTFAGYFIIF >gi568815587r:107404661_107660893|GENSCAN_predicted_CDS_2|1731_bp atgaggaaatactctagaggctggtggagccacaaggaatctgggcccctggatggtgga gaagcgccatcacctgtctctgactatcctcctatgggtggaccattagccaacagtgtt cacacctgtctcatttacctgctgtattctggggttcctctgttacagcagcttagccta tacccgactaaatacagtgatttagcagaggaaaggaaaagtggtgtgggaacaaatatt ccatccctgtggaatccgtcagattcatctgaggaagtggtttttagccaagtcttgaaa cctccgctgatgatacccaggcaaacagggtctggagtggacctccagcaaactctagca gacctgcagcagaggggcctgactgttagaaggaaaactaacaaacagaaaggaacagca tcaacaccaacaaaaaggacgtccactcagaaaccccatctgaaggtcaccaatatcaaa gaccaaagctttacgatggcaacaagtatggcggctgctagtggtagatttgaaagtgcg aagagtatcgaagagcggaaagaacagacccggaatgccagggccgaggtgttgcgccag gctaaagccaattttgaaaaagaagaaaggcgtaaagaacttaagcgacttcggggtgag gatacatggatgctacctgatgtgaatgagagaattgaacagttctcacaggccctgggg ctctgcaatcagcaggtgatgaagccagccaggcctgcgtctttcccttcagggtggcaa gttcccccaggcctgagggtgggtccagaggtgccatcagggacccagagctctgaagat gagtgggttgaggctgttccatcccagactcctgacaaggaaaaagcctggaaagtgaaa gatgaaaagtcaggaaaagatgacacccaaattatcaagagggatgagtggatgactgtt gattttatgtctgttaaaactgtgtcatcatcatcactcaaagctgaaaaggaaactatg aggaaaatagagcaagagaaaaaccaagcgcttgaacagtcaatggaaatatttcagtca aaattagaagatgctgaaaaagctgcatccacgaaagaagattatagacgggaacggtgg aggaaacccacatattcagataaagcacaaaattgtcaagaaagtagagaatcagactta gtaaaatatggtaacagttcaagggatagatatgctacaacagatactgcaaaaaatagc aataatgaaaaatttattggtgatgaaaaagataagagacctgggtctttagaaacgtgt agaagagaatctaacccaaggcaaaatcaagagttttcttttggcaatttgagagctaaa ttcttgagaccctctgatgatgaagaactgtcatttcacagcaagggcagaaaatttgaa ccacttagttcatcttcagcattggtagctcagggctctttgtgtagtggttttagaaaa cccaccaagaacagtgaagaaagattaacatcatggagtcgctctgatgggagaggagac aagaaacattcaaatcaaaagccatcggaaaccagtactgatgaacaccaacatgttcca gaagacccaagagaaaaatcacaagatgaagtcttgagagatgaccctccaaaaaaagaa catctacgggatacaaagtctacatttgctgggtattttatcattttttaa >gi568815587r:107404661_107660893|GENSCAN_predicted_peptide_3|227_aa MQTFAVSVTALKGGASGVVCSSRWVRGLADFRNEATDPCARGKAQCKGPRGKACDGEPQE ASVAGAESATVVGDQAWVYHACPEQISLVIHLQTNLAITLSSDIVCTAEQLDVQPVALDH CPCQWNTMSAESLMRQIWFENSEPNEFLQKPSEMRSATSDLSASQRRNTKQVHLPQFRRR RFAVTTEHTSGQPATCSSGKPTDVVLKKQLSQLKRKCPGHKSVIYTG >gi568815587r:107404661_107660893|GENSCAN_predicted_CDS_3|684_bp atgcagaccttcgcagtgagtgttacagctcttaaaggtggtgcatctggagttgtttgt tcctcccggtgggttcgtggtctcgctgacttcaggaatgaagccacagacccttgtgcc agaggaaaagcccagtgcaaaggcccacggggaaaggcctgcgatggagaaccgcaggag gccagtgtggctggagcagagtcagcgacagtggttggcgaccaggcatgggtctaccat gcctgccctgagcaaatctccctggtaatacatctgcaaacaaacctggcaattactttg agttcagatattgtatgcactgctgaacagctagatgtgcagccagtggccttagatcat tgcccttgtcaatggaacacaatgtcagcagagtctcttatgaggcagatttggtttgaa aactctgagccaaatgagttcctccaaaagccttctgaaatgagaagtgctacaagtgac ctctctgcttcccagaggagaaacacaaagcaagtgcacctgccacaattcagaaggagg aggtttgcagtgaccactgagcatacttctggtcaacctgctacctgctcatctggaaaa ccaacagatgtggttttaaagaagcaactttcacaattaaaaagaaagtgccctgggcac aagtctgtgatctatactggctga >gi568815587r:107404661_107660893|GENSCAN_predicted_peptide_4|743_aa MACKYPLRCSGARVERLAKKKAHACWYKFAMDSNHQSNYKLSKTEKKFLRKQIKAKHTLL RHEGIETVSYATQSLVVANGGLGNGVSRNQLLPVLEKCGLVDALLMPPNKPYSFARYRTT EESKRAYVTLNGKEVVDDLGQKITLYLNFVEKVQWKELRPQALPPGLMVVEEIISSEEEK MLLESVDWTEDTDNQNSQKSLKHRRVKHFGYEFHYENNNVDKDKPLSGGLPDICESFLEK WLRKGYIKHKPDQMTINQYEPGQGIPAHIDTHSAFEDEIVSLSLGSEIVMDFKHPDGIAV PVMLPRRSLLVMTGESRYLWTHGITCRKFDTVQASESLKSGIITSDVGDLTLSKRGLRTS FTFRKVRQTPCNCSYPLVCDSQRKETPPSFPESDKEASRLEQEYVHQVYEEIAGHFSSTR HTPWPHIVEFLKALPSGSIVADIGCGNGKYLGINKELYMANEETEALRYGCPNLHSHQQY ARIPFSPHPQRRLLSFVSLIMARSNKCEIGCDRSQNLVDICRERQFQAFVCDALAVPVRS GSCDACISIAVIHHFATAERRVAALQEIVRLLRPGGKALIYVWAMEQEYNKQKSKYLRGN RNSQGKKEEMNSDTSVQRSLVEQMRDMGSRDSASSVPRINDSQEGGCNSRQVSNSKLPVH VNRTSFYSQDVLVPWHLKGNPDKGKPVEPFGPIGSQDPSPVFHRYYHVFREGELEGACRT VSDVRILQSYYDQGNWCVILQKA >gi568815587r:107404661_107660893|GENSCAN_predicted_CDS_4|2232_bp atggcgtgcaagtatccgctgcggtgttctggtgctagagtggagaggctggcaaagaag aaggcacacgcatgttggtacaagtttgctatggacagcaaccatcaaagtaattacaaa ctcagtaaaactgagaagaagttcttaaggaaacagattaaagccaagcatactttgctg agacatgaaggcattgagacagtatcctatgccactcagagcctggttgttgccaatggt ggtttgggtaatggtgtgagtcggaaccagctgctcccggttttagagaaatgtggactg gtggatgctctcttaatgccacctaacaagccgtactcatttgcaagatacagaactaca gaagaatctaagagagcctatgttaccctcaatggaaaagaagtagtggatgatttagga caaaagatcactctgtatttgaattttgtggaaaaagtgcagtggaaggagttgaggcct caagccttaccaccaggactcatggtagtagaagaaataatttcttctgaggaggagaaa atgcttttggaaagtgttgattggacagaagatacagacaatcaaaactctcaaaaatcc ttaaaacacagaagagtaaagcattttggttatgagttccactatgagaacaacaatgta gataaagataagccattatctgggggtcttcctgacatttgtgaaagctttttggagaaa tggttgaggaaaggttacattaaacataaacctgatcaaatgaccataaatcagtatgaa cctgggcaaggaattcccgctcatattgatacacattccgcttttgaggatgagatcgtt tctctcagtttggggtcagagattgtcatggattttaagcacccagatggcattgcagtg ccagttatgttgcctcgtcggagtttgctggtgatgacaggagaatctagatacctttgg acccatggaatcacgtgcagaaaatttgatactgttcaagcatctgagagtcttaaaagt ggaattatcaccagtgatgttggagacttaactttaagcaagaggggactacgaacatca tttacatttaggaaagtgaggcaaacaccttgtaactgtagttacccgttggtctgtgat agccagaggaaagagactcccccctcatttccagagagtgataaagaagcctcacggctg gagcaagagtacgtccatcaggtttatgaagagattgctgggcacttcagcagcacaaga cataccccttggccgcacattgtggagtttttgaaggctttgccaagtggttcaatagtg gctgatattggatgtggtaatggaaagtatcttggcatcaataaggagttatatatggca aatgaggaaacagaggctttgagatatggctgtcctaatttacattcccaccaacagtat gccaggattcccttttccccacatcctcaacgccgtttgttatcttttgtctctttgata atggccagatctaacaagtgtgagattggttgtgatcgtagccaaaaccttgtggacatt tgtagagagaggcaatttcaggcttttgtctgtgatgcattggcagtaccagtccgcagt gggtcttgtgatgcctgcatctccattgctgttattcatcattttgcaacagcagagcgt agagtggcagctctccaagaaattgttcgactcctgagaccaggtgggaaggcactcatt tatgtctgggcaatggaacaagaatataataagcagaagtccaagtatcttagaggaaac agaaatagccaaggaaagaaagaggagatgaacagtgatacctcagtgcagaggtcactt gtggagcaaatgcgtgacatgggcagtcgagactcggcatcttctgtcccccgcattaat gactctcaggaaggaggatgtaattcaaggcaagtttctaattccaagctgcctgttcat gttaacaggacttctttttattctcaagatgtactggttccctggcaccttaagggaaat cctgataaaggcaaacctgttgagccatttggtcccataggatcccaggacccaagtcct gtgtttcatcgttactaccatgtgttccgtgagggagaactggaaggtgcctgcaggact gtgagtgatgtcagaattctgcaaagctactacgatcaaggaaactggtgtgtgattctt caaaaggcctga >gi568815587r:107404661_107660893|GENSCAN_predicted_peptide_5|579_aa MEMESEKDIGPRNQKIPYWRKVKGFLRMMANVNELRHLSQYKHTTCFPPYWQHQLRTVVK PPAQNVYIAEMQNNAWMPAQNWSLQMTDSIDSTNLRPTGAPSAVSCHSPLGCPILPARER DAPILKCGVSQQLEEAQLNSPADRSSQLAARRTPQSGGPAARLPSSSLETEAHKARGHSP LLPRLLWGRQSPGLWRRRSGYALRGGERRARSRAPRLDRGGRGCRTGCGRGRGGGGAGIW VAWSGASRDAPGAAGGEEATESSGGGRAEASLGARSGRLLSMPSPGAGGAGAVLGERARH PVPESGGKGLNLRTPLLWEVAGVAQFNCSRVASLEEPTTGRWSKNFVNAGAQAPLLSEKS KGVELTSKRWLLLQVPKSKKLMSENRIKVLSYSSLSYVKVDRYPKIRKEEDEHPREKQKL QQMPSHSQHPSAESASSERSPLKAITSMAFSLETSPYFGVLRAKGVLVMYSMALGPSRRS RFIKTGELQWRKSNSPEPAVPETRVLLLLKSVSLSIQGSEFLKIIWWADTQAAGRPEDVQ GSTPAEGHTNRHRQQTQAGRQAFNGQNYAEFGHGGWRRA >gi568815587r:107404661_107660893|GENSCAN_predicted_CDS_5|1740_bp atggagatggagagtgagaaagacattggacccaggaatcagaaaatcccatactggaga aaggtgaaaggatttcttagaatgatggcaaatgttaatgaattaagacatttatcccaa tacaagcatactacctgctttccaccatattggcagcaccagttaaggacggttgtcaag ccaccggcacaaaacgtatacattgcagagatgcagaacaatgcctggatgcctgcccaa aattggtctctccaaatgacagactcaattgatagtaccaatctacggcccactggggcc ccgtcagctgtcagctgtcactccccactcggctgtccaattctacctgcccgggaaaga gatgctccaatattgaagtgcggagtttcccagcagttggaggaggcgcagctcaacagc ccagccgaccgctcctcccagctcgctgcgcggaggaccccacagtccggcggccccgca gcccggctgcctagctcctccctggagaccgaagcccacaaggcaagggggcattcgccc ctactccctaggctcctctggggtcgccaatcacctggactctggcggcggcgctccggc tacgcgctccgcggcggcgagcgacgcgctcggtcccgggcgccgaggctggacaggggc ggccgcggctgccgcactggctgcgggcgcggaaggggcgggggcggggcaggaatctgg gtcgcgtggagcggagcaagccgggatgcacccggggcggctggcggcgaggaggcgacc gagagcagcgggggagggcgtgcagaggcttccctgggagcacgaagcgggcggctactg agcatgcccagtccgggagccgggggcgcgggcgcggtgctgggggagagagcgcggcac ccagtgccggaatctggggggaaagggttaaatctgcggaccccgctgctgtgggaggtg gctggtgtggcccagtttaattgctcccgggtggcctccctggaggagcccacgacagga agatggtcaaaaaattttgtgaatgctggtgctcaggccccactcctttcagagaagtcc aagggagttgaattaacatctaaaagatggctgctcctgcaggtacctaagagcaagaaa ctgatgtcagaaaacaggatcaaagttctgagttactcgagtctctcatatgtgaaagta gacaggtatcctaaaataagaaaggaggaggacgaacacccaagagaaaaacagaaatta caacaaatgccatcacactctcaacatcccagtgcagagtcagcttcaagtgagagaagt cctttgaaggctattacttctatggctttcagtttggagaccagtccctacttcggcgtt ttgagagcaaaaggtgttctcgtcatgtattccatggctttagggccttctaggcggagc cgatttatcaagacaggggaattgcaatggagaaagagtaattcaccagagccggctgtg ccggagaccagagttttattactactcaaatcagtctccctgagcattcagggatcagag tttttaaagataatttggtgggcagacacacaagcggctggacgtccagaggacgtccag gggagcacaccggcggaagggcacaccaacagacacaggcagcagacacaggcaggtcgg caggccttcaacgggcagaactatgcagagtttggccacggcggctggaggagagcctga >gi568815587r:107404661_107660893|GENSCAN_predicted_peptide_6|215_aa MKIETSLRDSKSKLFGAMVTSNDESHSLNMTLLKFSIPRLSDCPVRLEQACLLQIVGYRN LIADVEKLRREAYDSDNPQHEEMLLKLWKFLKPNTPLESRISKQWCEIGFQGDDPKTDFR GMGLLGLYNLQYFAERDATAAQQVLSDSLHPKCRWHILEEHQSEDTSPPQSLESDNKLAI FLINHWRGTDFVYESFLKKSFIVVSPVSAIQEKVF >gi568815587r:107404661_107660893|GENSCAN_predicted_CDS_6|648_bp atgaaaatcgaaacatcactgagggattctaaaagtaagttgtttggtgctatggtaaca agtaatgatgaaagtcattcactgaatatgactttattaaagttcagtattcctaggctg tctgactgtcctgtacgtttggagcaggcttgccttctgcaaatcgttgggtacaggaac cttattgcagatgtggaaaaactgcgtagagaggcctatgattctgataatccccaacat gaagaaatgcttttgaagttatggaaattcttgaagcccaatactccactggaatctcgg atttctaagcagtggtgtgaaattggtttccaaggtgatgatcctaaaacagactttcga ggaatgggacttctgggactgtacaatttgcagtatttcgcggaaagggatgccacagca gctcagcaggtcctgtctgactctcttcatccgaaatgcaggtggcatatcctagaggaa catcagtcagaagacacgagtcctccgcagtccttggaaagtgacaataagctcgcaata tttctgattaaccactggagggggactgactttgtctatgagtcattcttaaagaaaagc ttcatagttgtatctcctgtgtcagccatccaagagaaagtattttga