GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:23:02 Sequence gi568815595f:148727824_148959999 : 232176 bp : 37.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 306 368 63 2 0 73 83 23 0.302 1.50 1.02 Intr + 2237 2414 178 1 1 -23 84 136 0.351 0.67 1.03 Intr + 9710 9775 66 0 0 74 78 56 0.509 1.16 1.04 Intr + 13166 14244 1079 0 2 92 75 380 0.602 26.05 1.05 Intr + 19161 19269 109 2 1 85 79 101 0.576 7.84 1.06 Term + 20614 20765 152 2 2 32 42 191 0.577 5.99 1.07 PlyA + 22056 22061 6 1.05 2.05 PlyA - 22177 22172 6 1.05 2.04 Term - 24135 23957 179 1 2 60 49 119 0.153 2.07 2.03 Intr - 28241 28181 61 2 1 93 76 34 0.727 0.09 2.02 Intr - 29584 29472 113 2 2 54 72 137 0.979 7.88 2.01 Init - 32296 32242 55 1 1 59 97 16 0.977 1.10 2.00 Prom - 36548 36509 40 -7.95 3.04 PlyA - 38132 38127 6 1.05 3.03 Term - 38793 38462 332 1 2 89 49 256 0.958 15.63 3.02 Intr - 55634 54981 654 0 0 56 80 260 0.310 12.91 3.01 Init - 57644 57434 211 2 1 81 0 220 0.231 11.49 3.00 Prom - 66295 66256 40 -4.65 4.00 Prom + 74539 74578 40 -3.65 4.01 Init + 81271 81343 73 0 1 61 97 18 0.247 1.28 4.02 Term + 83025 83260 236 1 2 6 48 235 0.722 6.80 4.03 PlyA + 87038 87043 6 1.05 5.00 Prom + 95755 95794 40 -5.35 5.01 Init + 100001 100071 71 1 2 64 110 79 0.998 8.35 5.02 Intr + 100179 100254 76 0 1 58 111 87 0.933 6.60 5.03 Intr + 106675 106799 125 0 2 55 108 78 0.912 4.96 5.04 Intr + 111752 111938 187 2 1 45 31 100 0.510 -1.23 5.05 Intr + 112693 112962 270 0 0 42 72 207 0.965 11.32 5.06 Intr + 114000 114101 102 2 0 83 78 32 0.645 1.15 5.07 Intr + 116655 116765 111 2 0 37 71 151 0.989 7.96 5.08 Intr + 116854 116944 91 0 1 1 103 72 0.931 -1.35 5.09 Intr + 117601 117803 203 2 2 57 111 243 0.970 21.58 5.10 Intr + 124480 124545 66 0 0 91 75 31 0.442 0.28 5.11 Intr + 128833 128965 133 0 1 51 113 44 0.762 2.50 5.12 Term + 131992 132179 188 2 2 59 33 209 0.993 9.17 5.13 PlyA + 132336 132341 6 1.05 6.00 Prom + 134882 134921 40 -8.55 6.01 Init + 137485 137552 68 0 2 88 99 81 0.999 10.05 6.02 Intr + 137650 137725 76 1 1 61 99 106 0.997 7.60 6.03 Intr + 141092 141216 125 1 2 72 80 64 0.657 2.46 6.04 Intr + 150618 150720 103 0 1 67 87 82 0.806 5.26 6.05 Intr + 150824 150925 102 1 0 54 93 67 0.852 3.25 6.06 Intr + 151965 152066 102 2 0 30 111 51 0.669 1.05 6.07 Intr + 153699 153809 111 2 0 83 79 55 0.920 3.76 6.08 Intr + 154682 154772 91 1 1 46 74 49 0.984 -2.05 6.09 Intr + 155790 155992 203 1 2 86 84 196 0.983 17.08 6.10 Intr + 158270 158354 85 1 1 112 78 138 0.999 13.67 6.11 Term + 168697 168884 188 2 2 108 43 131 0.948 7.27 6.12 PlyA + 169234 169239 6 1.05 7.00 Prom + 175331 175370 40 -7.25 7.01 Init + 180214 180365 152 0 2 92 38 110 0.221 5.96 7.02 Intr + 191525 191642 118 2 1 31 82 41 0.009 -2.65 7.03 Intr + 204304 204384 81 0 0 91 93 9 0.114 0.62 7.04 Intr + 212946 213110 165 2 0 125 96 71 0.579 10.84 7.05 Term + 213161 213310 150 1 0 90 42 134 0.863 5.93 7.06 PlyA + 215106 215111 6 1.05 8.02 PlyA - 216208 216203 6 1.05 8.01 Term - 226028 225917 112 1 1 78 44 151 0.867 6.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:148727824_148959999|GENSCAN_predicted_peptide_1|548_aa MPKSKNNNNRALKHYLQNVKNHLQIHDLIRCFQQCFAPNWTHACYFIILIVNIIIIIIIF QMKKMNHKSTDSPKAPQLRGDLNCVEEENCDFIPLASWTPTHGVFDIVFATNSTQVIKMI LNSSTEDGIKRIQDDCPKAGRHNYIFVMIPTLYSIIFVVGIFGNSLVVIVIYFYMKLKTV ASVFLLNLALADLCFLLTLPLWAVYTAMEYRWPFGNYLCKIASASVSFNLYASVFLLTCL SIDRYLAIVHPMKSRLRRTMLVAKVTCIIIWLLAGLASLPAIIHRNVFFIENTNITVCAF HYESQNSTLPIGLGLTKNILGFLFPFLIILTSYTLIWKALKKAYEIQKNKPRNDDIFKII MAIVLFFFFSWIPHQIFTFLDVLIQLGIIRDCRIADIVDTAMPITICIAYFNNCLNPLFY GFLGKKFKRYFLQLLKYIPPKAKSHSNLSTKMSTLSYRPSDNATVNWSRLEVDQSQFFMR IFQTGADEELAPVLAFLRHRKQLVLLNRDKPPRAHHREESIMSADATADLLRISQRHVTQ EPTDADSS >gi568815595f:148727824_148959999|GENSCAN_predicted_CDS_1|1647_bp atgccaaaaagtaaaaataataataatagagccttgaaacattacttacagaatgtcaaa aaccacctccagatccatgatctcattcggtgtttccaacaatgttttgcaccaaactgg acacatgcttgctacttcatcatcctcatcgtgaacattattattattatcatcattttc cagatgaagaaaatgaatcacaagtcaactgacagtccaaaggctccacagctcagagga gacttaaactgtgtagaggaggagaactgtgactttattccccttgcatcctggacaccc acacatggtgtatttgatatagtgtttgcaacaaattcgacccaggtgatcaaaatgatt ctcaactcttctactgaagatggtattaaaagaatccaagatgattgtcccaaagctgga aggcataattacatatttgtcatgattcctactttatacagtatcatctttgtggtggga atatttggaaacagcttggtggtgatagtcatttacttttatatgaagctgaagactgtg gccagtgtttttcttttgaatttagcactggctgacttatgctttttactgactttgcca ctatgggctgtctacacagctatggaataccgctggccctttggcaattacctatgtaag attgcttcagccagcgtcagtttcaacctgtacgctagtgtgtttctactcacgtgtctc agcattgatcgatacctggctattgttcacccaatgaagtcccgccttcgacgcacaatg cttgtagccaaagtcacctgcatcatcatttggctgctggcaggcttggccagtttgcca gctataatccatcgaaatgtatttttcattgagaacaccaatattacagtttgtgctttc cattatgagtcccaaaattcaaccctcccgatagggctgggcctgaccaaaaatatactg ggtttcctgtttccttttctgatcattcttacaagttatactcttatttggaaggcccta aagaaggcttatgaaattcagaagaacaaaccaagaaatgatgatatttttaagataatt atggcaattgtgcttttctttttcttttcctggattccccaccaaatattcacttttctg gatgtattgattcaactaggcatcatacgtgactgtagaattgcagatattgtggacacg gccatgcctatcaccatttgtatagcttattttaacaattgcctgaatcctcttttttat ggctttctggggaaaaaatttaaaagatattttctccagcttctaaaatatattccccca aaagccaaatcccactcaaacctttcaacaaaaatgagcacgctttcctaccgcccctca gataatgccacagttaattggtccaggcttgaagtggaccaatcacagtttttcatgagg atttttcaaacaggcgctgatgaagagctggccccagtgttggcatttctgagacataga aagcagctggtgttactcaacagagataagcctcctagagctcaccacagagaggagagc atcatgtctgcagatgccacagcagatctgctgagaatcagccagcgccatgtgacacaa gaacccacagatgcggattcgtcctga >gi568815595f:148727824_148959999|GENSCAN_predicted_peptide_2|135_aa MYGGYVCVEEGQQWEVVFVWPTNQQQENLLAACKKYTLLGSVLDLLAESAASQDSQCWYS PTSGKQETEEVQNKGRLLDTLTSPSFIELVSCSWVLRRTFREQPTKNQAEIKNDAYKRSL CDSMSLSTVQREKFP >gi568815595f:148727824_148959999|GENSCAN_predicted_CDS_2|408_bp atgtatggaggttatgtgtgtgtggaggaggggcagcagtgggaggtggtttttgtgtgg cccacgaatcagcaacaagagaatctcctggcagcttgtaagaaatacacactcttaggc tctgtcctagacctactagcggaatcagcagcctcacaagattcccagtgttggtactca cctacatcaggtaaacaggagacagaagaggtacaaaacaaggggagattgctggatacc cttacatcaccaagcttcattgaattggtgtcatgctcttgggttttaagaagaacattc cgagaacaaccaactaagaatcaggctgaaataaaaaatgatgcttacaaacgatccctc tgtgactctatgtctctatccacagtccaacgtgagaaatttccttag >gi568815595f:148727824_148959999|GENSCAN_predicted_peptide_3|398_aa MAQELRDTCTSFSSQFNQVEERVLVIEDQMNEMKREEKFGEKRVKRNEQSLQEIWNYVKR PNLRLIGVTEKILITIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLDRPITG SEIEAIINSLPTKKSPGPDRFTAEFYQRYNEELVAFLLKLFQSTEKEGILPNSFYEASII LIPKPGRDTTIKENFRLISLMNMDTKILNKILANRIQQHIKKLIHHDQVGFIPGVQDWFN ICKSINIIHHINRTKDKNHMIISIDAEKAFDKMQQPFMLKTLNKLGTDALITYIRITSVE KSLNDLKELKTMARELPDTCISFRSQFDQLEERVSVIEDQMNEMKREEKFREKRVKRNKQ SLQEIWSYVKRPNLRLIGVPESDGENGTKLENTLQDII >gi568815595f:148727824_148959999|GENSCAN_predicted_CDS_3|1197_bp atggcacaagaactacgtgacacatgcacaagcttcagtagccaattcaatcaagtggaa gaaagggtattagtgattgaagatcaaatgaatgaaatgaagcgagaagagaaatttgga gaaaaaagagtaaaaagaaacgaacaaagcctccaagaaatatggaactatgtgaaaaga ccaaatctacgtctgattggcgtaactgaaaaaatactaattaccatcagagaatactat aaacacctctatgcaaataaactagaaaatctggaagaaatggataaattcctcgacaca tacaccctcccaagactaaaccaggaagaagttgaatcgctggatagaccaataacaggt tctgaaattgaggcaataattaatagcctaccaaccaaaaaaagtccaggaccagataga ttcacagctgaattctaccagaggtacaacgaggagctggtagcattccttctgaaacta ttccaatcaacagaaaaagagggaatcctccctaactcattttatgaggccagcatcatc ctgataccaaagcctggcagagacacaacaataaaagagaattttagactaatatccctg atgaacatggacacaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccacgatcaagttggcttcatccctggggtgcaagactggttcaat atatgcaaatcaataaacataatccatcatataaacagaaccaaagacaaaaatcacatg attatctcaatagatgcagaaaaggcctttgacaaaatgcaacagcccttcatgctaaaa actctcaataaactaggtactgatgcacttatcacttatattagaataaccagtgtagag aagtccttaaatgacctgaaggagctgaaaaccatggcacgagaactacctgacacatgc ataagcttcaggagccaatttgatcaactggaagaaagggtatcagtgattgaagatcaa atgaatgaaatgaagcgagaagagaagtttagagaaaaaagagtaaaaagaaacaaacaa agcctccaagaaatatggagctatgtgaaaagaccaaatctacgtctgattggtgtacct gaaagtgatggggagaatggaaccaagttggaaaacactctgcaggatattatctag >gi568815595f:148727824_148959999|GENSCAN_predicted_peptide_4|102_aa MTKEKDWDSLGTATWERQPTKKWLRWGIAEKIPQNVKATLELGNRQRGWNSLEGSEEDRK MWKSLEPPRDLLNGFDKHADSDMNNKVQAEVVSDGDEELVRN >gi568815595f:148727824_148959999|GENSCAN_predicted_CDS_4|309_bp atgacaaaggaaaaggactgggactctctagggacagcaacttgggaaagacaaccaaca aaaaaatggttaaggtggggcattgccgaaaagataccccaaaatgtgaaagcgactttg gaactgggtaacaggcagagaggttggaacagtttggagggctcagaagaagataggaaa atgtggaagagtttggaacctcctagagacttgttgaatggctttgacaaacatgctgat agtgatatgaacaataaggttcaggctgaggtggtctcagatggagatgaggaacttgtt aggaactga >gi568815595f:148727824_148959999|GENSCAN_predicted_peptide_5|540_aa MLALLVLVTVALASAHHGGEHFEGEKVFRVNVEDENHINIIRELASTTQIDFWKPDSVTQ IKPHSTVDFRVKAEDTVTVENVLKQNELQYKKQIPVSINLLHFPKTDTSSYPSSPIEGVT RRKVQRMSLTYFIRMSLGQPEEKQTFDVCLKFSNLSGAAEGHQEEPEDNMRIYGECNVPS TKAVALGFYVCSLEKNTLMFIGTPTLVRCRVLISNLRNVVEAQFDSRVRATGHSYEKYNK WETVGKAGQNKPAIFMDCGFHAREWISPAFCQWFVREAVRTYGREIQVTELLDKLDFYVL PVLNIDGYIYTWTKSRFWRKTRSTHTGSSCIGTDPNRNFDAGWCEIGASRNPCDETYCGP AAESEKETKALADFIRNKLSSIKAYLTIHSYSQMMIYPYSYAYKLGENNAELMTFGNVWE TSLVVITVELYWHLAVCQWSFTTEFLSAESLSLGVGCRELEGNGSYEKVMMLLRKKITHP AAGGSDDWAYDQGIRYSFTFELRDTGRYGFLLPESQIRATCEETFLAIKYVASYVLEHLY >gi568815595f:148727824_148959999|GENSCAN_predicted_CDS_5|1623_bp atgttggcactcttggttctggtgactgtggccctggcatctgctcatcatggtggtgag cactttgaaggcgagaaggtgttccgtgttaacgttgaagatgaaaatcacattaacata atccgcgagttggccagcacgacccagattgacttctggaagccagattctgtcacacaa atcaaacctcacagtacagttgacttccgtgttaaagcagaagatactgtcactgtggag aatgttctaaagcagaatgaactacaatacaagaaacagattccagttagtattaattta ttacactttccaaaaacagacacaagttcgtacccaagttcaccaattgaaggagtgacc aggaggaaggttcagaggatgtccctgacatacttcatcagaatgtccttggggcagcca gaggagaagcagacttttgatgtgtgtctgaaattctctaatctctctggagcagctgaa ggtcatcaggaagaaccagaggacaacatgagaatttatggagagtgcaatgtgccatct actaaagcagtggctctggggttttatgtgtgttcactggagaaaaatacacttatgttc ataggaacacccactttggttcgttgcagggtactgataagcaacctgagaaatgtggtg gaggctcagtttgatagccgggttcgtgcaacaggacacagttatgagaagtacaacaag tgggaaacggttggcaaagctggacaaaataagcctgccattttcatggactgtggtttc catgccagagagtggatttctcctgcattctgccagtggtttgtaagagaggctgttcgt acctatggacgtgagatccaagtgacagagcttctcgacaagttagacttttatgtcctg cctgtgctcaatattgatggctacatctacacctggaccaagagccgattttggagaaag actcgctccacccatactggatctagctgcattggcacagaccccaacagaaattttgat gctggttggtgtgaaattggagcctctcgaaacccctgtgatgaaacttactgtggacct gccgcagagtctgaaaaggagaccaaggccctggctgatttcatccgcaacaaactctct tccatcaaggcatatctgacaatccactcgtactcccaaatgatgatctacccttactca tatgcttacaaactcggtgagaacaatgctgagttgatgacatttggcaatgtttgggag acatctttggttgtcataactgtcgagctctactggcatctagcagtatgtcagtggagt tttaccacagagtttctgagcgcagagagcctttccctaggagtggggtgcagggaactg gagggcaatggttcttatgagaaggtgatgatgcttctaagaaaaaagattacacatcct gctgctgggggctctgacgactgggcttatgaccaaggaatcagatattccttcaccttt gaacttcgagatacaggcagatatggctttctccttccagaatcccagatccgggctacc tgcgaggagaccttcctggcaatcaagtatgttgccagctacgtcctggaacacctgtac tag >gi568815595f:148727824_148959999|GENSCAN_predicted_peptide_6|417_aa MRLILPVGLIATTLAIAPVRFDREKVFRVKPQDEKQADIIKDLAKTNELDFWYPGATHHV AANMMVDFRVSEKESQAIQSALDQNKMHYEILIHDLQEEIEKQFDVKEDIPGRHSYAKYN NWEKIVAWTEKMMDKYPEMVSRIKIGSTVEDNPLYVLKIGEKNERRKAIFTDCGIHAREW VSPAFCQWFVYQATKTYGRNKIMTKLLDRMNFYILPVFNVDGYIWSWTKNRMWRKNRSKN QNSKCIGTDLNRNFNASWNSIPNTNDPCADNYRGSAPESEKETKAVTNFIRSHLNEIKVY ITFHSYSQMLLFPYGYTSKLPPNHEDLAKVAKIGTDVLSTRYETRYIYGPIESTIYPISG SSLDWAYDLGIKHTFAFELRDKGKFGFLLPESRIKPTCRETMLAVKFIAKYILKHTS >gi568815595f:148727824_148959999|GENSCAN_predicted_CDS_6|1254_bp atgaggctcatcctgcctgtgggtttgattgctaccactcttgcaattgctcctgtccgc tttgacagggagaaggtgttccgcgtgaagccccaggatgaaaaacaagcagacatcata aaggacttggccaaaaccaatgagcttgacttctggtatccaggtgccacccaccacgta gctgctaatatgatggtggatttccgagttagtgagaaggaatcccaagccatccagtct gccttggatcaaaataaaatgcactatgaaatcttgattcatgatctacaagaagagatt gagaaacagtttgatgttaaagaagatatcccaggcaggcacagctacgcaaaatacaat aattgggaaaagattgtggcttggactgaaaagatgatggataagtatcctgaaatggtc tctcgtattaaaattggatctactgttgaagataatccactatatgttctgaagattggg gaaaagaatgaaagaagaaaggctatttttacggattgtggcattcacgcacgagaatgg gtctccccagcattctgccagtggtttgtctatcaggcaaccaaaacttatgggagaaac aaaattatgaccaaactcttggaccgaatgaatttttacattcttcctgtgttcaatgtt gatggatatatttggtcatggacaaagaaccgcatgtggagaaaaaatcgttccaagaac caaaactccaaatgcatcggcactgacctcaacaggaattttaatgcttcatggaactcc attcctaacaccaatgacccatgtgcagataactatcggggctctgcaccagagtccgag aaagagacgaaagctgtcactaatttcattagaagccacctgaatgaaatcaaggtttac atcaccttccattcctactcccagatgctattgtttccctatggatatacatcaaaactg ccacctaaccatgaggacttggccaaagttgcaaagattggcactgatgttctatcaact cgatatgaaacccgctacatctatggcccaatagaatcaacaatttacccgatatcaggt tcttctttagactgggcttatgacctgggcatcaaacacacatttgcctttgagctccga gataaaggcaaatttggttttctccttccagaatcccggataaagccaacgtgcagagag accatgctagctgtcaaatttattgccaagtatatcctcaagcatacttcctaa >gi568815595f:148727824_148959999|GENSCAN_predicted_peptide_7|221_aa MAPLSAGPPKKAGQPYPTGQGQQFPKQVPKHATQWLGPECIDHSTSSCPRELGTRQGCLL SPLLFYIVLEGLASAIMQEKEIKACKSEKKQPLLQLPIAGALAFLGMILQKQKLSRTATA LYQASLRETCLIPQVCGKTRLLLEMEKGLWGGQDASCQSNKVPVKSCRPQRQQIITVSCQ HFPHILQKQPLDPKHNSSFDSFDDGATAKCCGNQNCLKTAD >gi568815595f:148727824_148959999|GENSCAN_predicted_CDS_7|666_bp atggctccactgagtgcaggtccacccaagaaagctggtcagccttatccaacagggcaa ggacagcaatttccgaaacaagttcccaaacacgcgacacaatggcttggaccagaatgc atagaccatagtacatccagctgccccagggaactgggaacaaggcaaggatgtctgctc tcaccactcttattctacatagtgctggaaggtctagccagtgcaataatgcaagaaaaa gaaataaaagcatgcaaatcagaaaagaagcagcctcttctccagcttcccatagcgggt gctctagcatttttggggatgattctccaaaaacaaaaactttccagaactgcaactgct ctataccaggcctctcttagggagacatgtcttatccctcaggtgtgcggaaaaacacgg cttctgctggagatggaaaaaggactctggggaggacaagatgcctcatgtcaaagcaac aaggttccagtgaagagctgcaggcctcagaggcagcaaatcatcacagtgagctgtcag cattttccccacattctccagaagcagcctctggatcctaaacacaattcttcctttgat tcttttgatgacggtgcaacagcaaaatgctgtggcaatcaaaactgcctgaaaacagca gattag >gi568815595f:148727824_148959999|GENSCAN_predicted_peptide_8|37_aa XSRVARSRISVKHNAQNGSEEQPVREAARLTGQGEGS >gi568815595f:148727824_148959999|GENSCAN_predicted_CDS_8|114_bp nngagcagagttgctagatcaaggatatcagttaaacataatgctcaaaatggcagtgag gagcaaccagtgagggaagcggcaagactgacgggacagggtgaaggctcctga