GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:25:02 Sequence gi568815588f:123036051_123237728 : 201678 bp : 44.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4416 4622 207 2 0 97 87 85 0.991 8.57 1.02 Intr + 5159 5329 171 1 0 71 87 81 0.828 6.44 1.03 Intr + 6996 7121 126 2 0 83 98 4 0.771 1.78 1.04 Intr + 7408 7572 165 0 0 11 45 126 0.585 0.76 1.05 Intr + 8343 8435 93 2 0 54 111 38 0.808 2.86 1.06 Intr + 11159 11248 90 1 0 87 91 3 0.546 0.69 1.07 Intr + 14999 15136 138 1 0 74 83 29 0.687 1.66 1.08 Intr + 17011 17110 100 0 1 54 27 104 0.617 0.58 1.09 Term + 17645 17715 71 0 2 90 45 98 0.838 3.80 1.10 PlyA + 17804 17809 6 1.05 2.08 PlyA - 18502 18497 6 1.05 2.07 Term - 19121 19099 23 0 2 105 48 16 0.330 -2.23 2.06 Intr - 26975 26792 184 2 1 32 36 157 0.041 4.26 2.05 Intr - 37425 37192 234 0 0 -30 72 235 0.058 7.99 2.04 Intr - 49614 49483 132 0 0 -3 89 178 0.958 9.64 2.03 Intr - 53004 52852 153 0 0 53 119 52 0.803 5.07 2.02 Intr - 62926 62846 81 1 0 69 75 81 0.402 4.73 2.01 Init - 64809 64684 126 0 0 91 42 33 0.296 -0.81 2.00 Prom - 71016 70977 40 -1.56 3.03 PlyA - 71191 71186 6 1.05 3.02 Term - 76008 75860 149 1 2 54 38 117 0.539 1.36 3.01 Init - 82213 82033 181 1 1 83 0 238 0.349 11.85 3.00 Prom - 87404 87365 40 -4.26 4.00 Prom + 91523 91562 40 -5.86 4.01 Init + 98383 98965 583 0 1 54 64 310 0.839 20.55 4.02 Intr + 99900 100400 501 1 0 54 81 484 0.020 37.25 4.03 Term + 101008 101681 674 2 2 83 54 1200 0.986 110.02 4.04 PlyA + 104604 104609 6 1.05 5.00 Prom + 104682 104721 40 -5.46 5.01 Init + 111899 112011 113 1 2 44 58 141 0.121 4.43 5.02 Intr + 112328 112596 269 2 2 47 82 242 0.139 16.68 5.03 Term + 113520 114073 554 1 2 116 47 879 0.999 81.08 5.04 PlyA + 114603 114608 6 1.05 6.00 Prom + 115857 115896 40 -7.16 6.01 Init + 118697 119066 370 1 1 75 17 498 0.782 36.56 6.02 Intr + 121679 121830 152 0 2 106 102 29 0.970 5.98 6.03 Intr + 124357 124515 159 0 0 29 72 160 0.990 8.68 6.04 Intr + 126186 126363 178 2 1 101 103 103 0.999 12.59 6.05 Term + 126562 126782 221 2 2 70 48 174 0.540 8.80 6.06 PlyA + 127821 127826 6 1.05 7.03 PlyA - 127857 127852 6 1.05 7.02 Term - 135825 135482 344 1 2 32 49 308 0.603 15.97 7.01 Init - 167528 167411 118 2 1 54 64 103 0.723 4.89 7.00 Prom - 172803 172764 40 -1.26 8.03 PlyA - 174428 174423 6 1.05 8.02 Term - 186402 186272 131 1 2 56 48 62 0.162 -2.66 8.01 Init - 190111 189982 130 1 1 44 57 154 0.621 8.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 24252 24316 65 2 2 37 116 60 0.820 4.32 S.002 Intr - 91208 91080 129 2 0 64 103 83 0.861 8.29 S.003 Term + 99247 99404 158 2 2 112 44 45 0.891 0.60 S.004 Init + 100001 100400 400 1 1 104 81 557 0.967 51.43 S.005 Init + 112329 112596 268 2 1 83 82 233 0.855 19.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:123036051_123237728|GENSCAN_predicted_peptide_1|386_aa LMGIEVDPEYGGTGASFLSTVLVIEELAKVDASVAVFCEIQNTLINTLIRKHGTEEQKAT YLPQLTTEKVGSFCLSEAGAGSDSFALKTRADKEGDYYVLNGSKMWISSAEHAGLFLVMA NVDPTIGYKGITSFLVDRDTPGLHIGKPENKLGLRASSTCPLTFENVKLETCEPNGAAEK EVWCQCTKGHKMGVLLSESGNSQAIKREQQKIQIVSADAGELPVPEANILGQIGHGYKYA IGSLNEGRIGIAAQMLGLAQGCFDYTIPYIKERIQFGKRLFDFQGLQHQVAHVATQLEAA RLLTYNAARLLEAGKPFIKEASMAKYYASEIAGQTTSKCIEWMGGVGYTKDYPVEKYFRD AKIGTIYEGASNIQLNTIAKHIDAEY >gi568815588f:123036051_123237728|GENSCAN_predicted_CDS_1|1161_bp ttgatgggtattgaagttgacccagaatatggaggcacaggagcttcatttttatccact gtgctcgtgatagaggaattagccaaagttgatgcatctgtggctgtcttttgtgagatc cagaacacattaattaacacactgattagaaaacatggaacagaagaacaaaaggccacc tatttgcctcagctcactacagaaaaagtaggaagtttctgcctttcagaggctggagca ggtagtgactcatttgctttgaagaccagagctgataaagagggagattattatgtcctc aatggatcaaagatgtggatcagcagtgctgagcacgcagggctctttctggtgatggca aatgtagaccctaccattggatataagggaattacctccttcttagtagatcgtgatact ccgggccttcatatagggaaacctgaaaacaaattggggctcagagcttcttccacctgc ccgttaacattcgaaaatgtcaagcttgagacttgtgaaccaaatggggccgcagagaag gaagtctggtgtcagtgcactaaaggccacaagatgggggtgttgctcagtgagagcgga aacagccaggctattaaacgtgagcagcagaagatccagatagttagtgctgatgctgga gaactccctgttccagaagccaatatcttgggacaaattggacatggctataagtatgcc atagggagtctcaatgaaggtagaataggaattgctgcacagatgctgggactggcgcaa ggatgttttgactacactattccatatattaaagaaaggatacaatttggcaaaagacta tttgattttcagggcctccaacaccaagtggctcacgtggccacccagctggaagctgca agattactaacatacaatgctgctaggcttttagaagctggaaagccattcataaaagaa gcgtcaatggccaaatactatgcatcagagattgcaggacaaacaacgagtaaatgtatc gagtggatggggggagtaggctacaccaaagattaccctgtggagaaatacttccgagat gcaaagattggtacgatatatgaaggagcttccaacatccagttgaacaccattgcaaag catatcgatgcagaatactga >gi568815588f:123036051_123237728|GENSCAN_predicted_peptide_2|310_aa MTCRRMSGEGSFSLEGLSQNYSQKALETYSQDMPQRSSLSRQSGNDLSILHRHSLAEPRE THEDVKDSKHQGFSRWALVIFQQQHFPRVAPQSVQQDDGLCPLIVPEGPRPVSWHLKGAL TGLGAVKKSEDDIIEGDLAVSEIKEEGGCEPDEDPKRMCCQEAQVRQANSLSPPQTVHGT SRGQGSFLNPLLFVESTCSLGLRHRCGLHGPTEDVPESLDVAVTGECIYSGVFGALPFMD SKITKHDNEDGGMSRADSVDDQEGGRADNMDTLDKGMIHVLIGTEQNCARFHHATQNDVQ FKTCMSLSAA >gi568815588f:123036051_123237728|GENSCAN_predicted_CDS_2|933_bp atgacctgcaggagaatgagtggagagggctctttctctctcgaggggctttcacagaac tactcccagaaggccctggagacctactctcaggacatgcctcagcggtcaagtctttcc agacagagtggcaatgatcttagcatccttcaccggcacagccttgccgagcccagggag acgcatgaagacgtgaaggactccaagcatcaaggcttctctcggtgggccctggtcata ttccaacagcagcatttccccagagtggctccccagagtgttcaacaggacgatggcctg tgtcccctcattgttccagaaggccccaggcccgtctcatggcatctgaagggggctctg actgggttaggagcagtgaagaagagtgaggatgatatcatagagggtgacttggcagtt agtgagatcaaggaggagggtggctgcgagccagatgaggaccccaagaggatgtgctgc caggaagcacaggtaagacaagcaaacagcctgagtcccccacaaacagtgcatggaacc agcagaggccagggctccttcctcaaccccttgctcttcgtggagtccacctgctccctg ggcctcagacacagatgtggcctgcatgggcccacggaggacgttccagagtccctcgat gtggccgtcacaggggagtgcatctactcaggagtgtttggggcattacccttcatggat tcaaagatcaccaagcatgacaatgaagatggcgggatgtcaagagcagacagtgttgat gaccaagaagggggtagagcagacaacatggatacgctggacaaagggatgattcacgtt cttattgggacggagcagaattgtgcaagatttcatcatgctactcagaacgacgtgcaa tttaaaacttgtatgtctttatcagcagcgtga >gi568815588f:123036051_123237728|GENSCAN_predicted_peptide_3|109_aa MVSAVQLGACLPARAGGRLAAVLLPVQALGGGSGGEAVTDFECPDPSNTPFGACFRCSRI ARIVKLFDFSQPGTSGGVESASGKSAIGFLFREAKIHATQHNLSIFISI >gi568815588f:123036051_123237728|GENSCAN_predicted_CDS_3|330_bp atggtttcggccgtccagcttggggcctgcctgccagctcgggcgggtggccgacttgct gcagtgctgctcccagtccaggcactgggtggcggctctggtggtgaagcagtcactgac ttcgaatgtcccgatccatccaacactccgtttggggcgtgcttccggtgttcccgcata gctaggattgtcaagctctttgattttagccaaccaggtacaagtggaggtgttgaaagc gcctctggaaagtctgctataggatttctgttccgtgaagccaaaattcatgctacccag cacaaccttagtatcttcatctctatttaa >gi568815588f:123036051_123237728|GENSCAN_predicted_peptide_4|585_aa MSLKCLPHGLLVRTGELFLKPVEWNVLSLSRSAKVQCGSLRTSLPSRRAADAAACRGGGS YAAAPSVHLAPIKALVRQLQTPAGRRDFLGESAVVGVTDSELIREPWAPAARAEGCPSES SRRCREAVAPAAPPSDARVLAGSARIRTQAHPLLVPGEGSLFGFPGLKAKATRISESEDA VRGLTPGFPYYRLDVEFRLWSDFRPLACSPRRPPVPLPPSRGPGGEGTMPEPGPDAAGTA SAQPQPPPPPPPAPKESPFSIKNLLNGDHHRPPPKPQPPPRTLFAPASAAAAAAAAAAAA AKGALEGAAGFALSQVGDLAFPRFEIPAQRFALPAHYLERSPAWWYPYTLTPAGGHLPRP EASEKALLRDSSPASGTDRDSPEPLLKADPDHKELDSKSPDEIILEESDSEESKKEGEAA PGAAGASVGAAAATPGAEDWKKGAESPEKKPACRKKKTRTVFSRSQVFQLESTFDMKRYL SSSERAGLAASLHLTETQVKIWFQNRRNKWKRQLAAELEAANLSHAAAQRIVRVPILYHE NSAAEGAAAAAAGAPVPVSQPLLTFPHPVYYSHPVVSSVPLLRPV >gi568815588f:123036051_123237728|GENSCAN_predicted_CDS_4|1758_bp atgagtcttaaatgtctgcctcatgggcttctcgtgcgaaccggggagctgtttttaaag ccagtggagtggaatgtcctttcactaagccgttctgcaaaggtccagtgtggttctcta aggacttcgctgccgtcgagacgcgcagcggatgccgctgcttgccgcggcggaggaagc tacgccgctgcgccttcggttcaccttgcccccattaaagcgttagttagacagctccaa acgccagctggccggagagacttcctgggagaaagtgctgtcgtcggggtcacagactcg gagttgatccgcgagccttgggcaccggcggccagagcagaggggtgcccttccgagagc agccgtcggtgccgagaagctgttgccccagcagccccaccgagtgacgcgcgggttttg gccgggtcagcgcgcatacgcacacaggcacacccactcctggtcccgggggaaggctca ctcttcggttttccagggctaaaggccaaggccacgcggatttctgagtccgaagacgca gtgcgaggacttacacctggattcccttactatcggctggatgttgagttccgtttatgg tctgatttccggcctctcgcctgctcgccccgccgcccgcctgtcccgctccctccctcc cggggacccggaggagaggggaccatgccggaacccgggccggacgctgccggcaccgcc agcgcacagccccaaccgccgccgccccccccacccgctcccaaggagtccccgttctcc atcaagaacctgctcaacggagaccaccaccggccgccccctaagcctcagccgccccca cggacgctcttcgcgccagcctcggctgccgccgccgccgccgctgccgctgccgcggcg gccaagggggccctggagggcgccgcgggcttcgcgctctcgcaggtgggcgacctggct ttccctcgctttgagatcccggcgcagaggtttgccctgcccgcgcactacctggagcgc tccccagcctggtggtacccctacaccctgacccccgccggcggccacctcccgcgacct gaagcctcggagaaggccttgctgagagactcctcccccgcctccggcacagaccgcgac tctccggagccactgctcaaggccgaccccgatcacaaggagctggactccaagagcccg gacgagatcattctggaggagagcgactccgaggaaagcaaaaaggaaggcgaagcggcg ccaggcgcggccggggcgagcgtaggggcggcggcggccactccgggcgcagaagactgg aagaagggcgctgaaagtccagagaagaagccggcgtgccgcaagaagaagacgcgcaca gtcttctcgcgcagccaggtcttccagctcgagtccaccttcgacatgaagcgctatctg agcagctcggagcgagccggcctggccgcgtccctgcacctcaccgagacgcaggtcaag atctggttccagaaccgccgcaacaagtggaagcggcagctggcggcggagctggaggcg gccaacctgagccatgccgcggcgcagcgcatcgtgcgggtgcccatcctctaccacgag aactcggcggccgagggcgcggcggctgcagccgcgggggccccggtgccagtcagccag ccgctgctcaccttcccgcaccccgtctactactcgcacccggtggtctcttccgtgccg ctgctacggccggtctga >gi568815588f:123036051_123237728|GENSCAN_predicted_peptide_5|311_aa MPARPSARSGGAPVLCAWPERFEIFRGRSAALRLAAFGMGSKEDAGKGCPAAGGVSSFTI QSILGGGPSEAPREPVGWPARKRSLSVSSEEEEPDDGWKAPACFCPDQHGPKEQGPKHHP PIPFPCLGTPKGSGGSGPGGLERTPFLSPSHSDFKEEKERLLPAGSPSPGSERPRDGGAE RQAGAAKKKTRTVFSRSQVYQLESTFDMKRYLSSSERACLASSLQLTETQVKTWFQNRRN KWKRQLSAELEAANMAHASAQTLVSMPLVFRDSSLLRVPVPRSLAFPAPLYYPGSNLSAL PLYNLYNKLDY >gi568815588f:123036051_123237728|GENSCAN_predicted_CDS_5|936_bp atgcccgccaggccctcggcgcgctctgggggcgcaccggtgctgtgtgcctggccggag cgttttgaaattttccggggccgctcggcggcgctgcgattggccgcgttcgggatgggc agcaaagaagatgcgggcaaggggtgtccggcggccggtggcgtctccagcttcaccatc cagtccatcctgggcgggggcccctcggaggcaccgcgggagcccgtcggctggccagcc aggaagcgcagcctgtccgtgtcctcggaggaggaggagccggacgacggctggaaggcg cccgcctgcttctgcccagaccagcacggccctaaggagcagggccccaagcaccatccc cccatcccttttccttgcctgggtacccccaagggcagcggaggctcgggcccgggcggc ttggagcgcacgcctttcctctctccttcgcactcggactttaaagaagagaaagagagg ctcctgcccgcgggctcgccctcgccggggtccgagcggccgcgggacggcggcgctgag cggcaggccggcgcggccaagaagaagacgcgcaccgtcttttcgcgcagccaggtgtac cagctcgagtccaccttcgacatgaagcgctacctgagcagctcggagcgcgcctgcctc gcctccagcctgcagctcacggagacccaggtaaagacttggttccagaaccgccgcaac aagtggaagcggcagctctcggctgagctggaggcggccaacatggcgcacgcgtcggcg cagactctggtgagcatgccgctggtgttccgggacagttcgctgctgcgcgtgccggtg ccgcgctcgctcgcctttcccgcgccgctctactacccgggaagcaacctctcggcctta cctctctacaacctatacaacaagctcgactactga >gi568815588f:123036051_123237728|GENSCAN_predicted_peptide_6|359_aa MGRVRTLAGECSAQAQAQSLLAVVLSAPPSGGTPSARLSVRSPSPRDPWGLWAPVLQMTG SNEFKLNQPPEDGISSVKFSPNTSQFLLVSSWDTSVRLYDVPANSMRLKYQHTGAVLDCA FYVENLVGTHDAPIRCVEYCPEVNVMVTGSWDQTVKLWDPRTPCNAGTFSQPEKVYTLSV SGDRLIVGTAGRRVLVWDLRNMGYVQQRRESSLKYQTRCIRAFPNKQGYVLSSIEGRVAV EYLDPSPEVQKKKYAFKCHRLKENNIEQIYPVNAISFHNIHNTFATGGSDGFVNIWDPFN KKRLCQFHRYPTSIASLAFSNDGTTLAIASSYMYEMDDTEHPEDGIFIRQVTDAETKPK >gi568815588f:123036051_123237728|GENSCAN_predicted_CDS_6|1080_bp atggggcgagtccggaccttggcgggcgagtgctcggcgcaggcgcaagcgcagagtctc ctcgcggtcgtcctctcggcccctccctctggggggacccccagtgccaggctgtcagtg cgcagccccagcccgcgggacccctggggactctgggcgcctgttctgcagatgaccggt tctaacgagttcaagctgaaccagccacccgaggatggcatctcctccgtgaagttcagc cccaacacctcccagttcctgcttgtctcctcctgggacacgtccgtgcgtctctacgat gtgccggccaactccatgcggctcaagtaccagcacaccggcgccgtcctggactgcgcc ttctacgtagaaaatcttgttgggacccatgatgcccctatcagatgtgttgaatactgt ccagaagtgaatgtgatggtcactggaagttgggatcagacagttaaactgtgggatccc agaactccttgtaatgctgggaccttctctcagcctgaaaaggtatataccctctcagtg tctggagaccggctgattgtgggaacagcaggccgcagagtgttggtgtgggacttacgg aacatgggttacgtgcagcagcgcagggagtccagcctgaaataccagactcgctgcata cgagcgtttccaaacaagcagggttatgtattaagctctattgaaggccgagtggcagtt gagtatttggacccaagccctgaggtacagaagaagaagtatgccttcaaatgtcacaga ctaaaagaaaataatattgagcagatttacccagtcaatgccatttcttttcacaatatc cacaatacatttgccacaggtggttctgatggctttgtaaatatttgggatccatttaac aaaaagcgactgtgccaattccatcggtaccccacgagcatcgcatcacttgccttcagt aatgatgggactacgcttgcaatagcgtcatcatatatgtatgaaatggatgacacagaa catcctgaagatggtatcttcattcgccaagtgacagatgcagaaacaaaacccaagtga >gi568815588f:123036051_123237728|GENSCAN_predicted_peptide_7|153_aa MIPDSKTDSGPWMLGYLLPRALLICHSLGLQFVLSFRNGNDKRRNNGRAKKGHGHVQPIC CTNCAQCVPKDKAIKKFVIRNIVEAAAVRDISEAGIFDAYVLPKLYVKLHYCVSCAIHNK VVRNQSCEARKDRTPPPRFRPAGAAPPPLPKPI >gi568815588f:123036051_123237728|GENSCAN_predicted_CDS_7|462_bp atgatacctgattccaagaccgactctggtccctggatgctgggctacctccttcctagg gccctgctgatctgccactcactgggcttgcagtttgtgctgtccttccgtaatgggaat gacaaaagaaggaacaacggtcgtgccaaaaagggccatggccacgtgcagcctatttgc tgcactaactgtgcccaatgcgtgcccaaggacaaggccattaagaaattcgtcattcga aacatagtggaggccgcagcagtcagggacatttctgaagcgggcatcttcgatgcctat gtgcttcccaagctgtatgtgaagctacattactgtgtgagttgtgcaattcacaacaaa gtagtcaggaatcaatcttgtgaagcccgcaaggaccgaacacccccaccccgatttaga cctgcgggtgctgccccacctcccctaccaaagcccatataa >gi568815588f:123036051_123237728|GENSCAN_predicted_peptide_8|86_aa MWEALELPSDLKGFDQNADNDTDNEIQAEVVSDGDEELVNWSKAELQTVSEKDIVAVPAP MDIKWNRDKPIPLCSVQILNSLNHKK >gi568815588f:123036051_123237728|GENSCAN_predicted_CDS_8|261_bp atgtgggaagctttggaacttcctagtgacttaaaaggctttgaccaaaatgctgataat gatacggacaatgaaatccaggctgaggtggtctcagatggagatgaggaacttgtgaac tggagtaaagctgagctacagacagtgagtgaaaaagacattgtggctgttccagccccc atggacatcaagtggaacagagacaaaccaatcccactatgctctgtgcaaattcttaac tcactgaatcataagaaatga