GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:33:40 Sequence gi568815589f:131189998_131408284 : 218287 bp : 50.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7219 9018 1800 2 0 88 110 617 0.858 51.22 1.02 Intr + 11650 11720 71 2 2 107 99 54 0.987 7.30 1.03 Intr + 25215 25371 157 2 1 90 108 40 0.903 5.78 1.04 Intr + 32781 32933 153 1 0 124 78 55 0.957 8.14 1.05 Intr + 38163 38334 172 1 1 96 102 95 0.933 10.70 1.06 Intr + 40633 40772 140 1 2 42 99 52 0.684 1.81 1.07 Intr + 42287 42311 25 0 1 110 99 8 0.523 1.18 1.08 Intr + 42424 42563 140 1 2 96 55 20 0.550 -0.39 1.09 Intr + 43730 43918 189 0 0 40 68 95 0.019 2.26 1.10 Intr + 45966 46043 78 1 0 71 75 36 0.019 0.12 1.11 Intr + 49894 50023 130 2 1 89 33 78 0.002 2.15 1.12 Intr + 62309 62490 182 2 2 99 36 200 0.954 15.41 1.13 Term + 62698 62834 137 2 2 90 49 119 0.985 6.38 1.14 PlyA + 63902 63907 6 1.05 2.07 PlyA - 68102 68097 6 1.05 2.06 Term - 71353 70825 529 0 1 112 48 1233 0.166 115.43 2.05 Intr - 86138 85860 279 1 0 -4 84 469 0.187 33.89 2.04 Intr - 86946 86693 254 0 2 96 33 154 0.192 6.93 2.03 Intr - 87229 87154 76 0 1 45 66 88 0.328 1.82 2.02 Intr - 90423 90247 177 2 0 102 56 107 0.341 7.93 2.01 Init - 91915 91875 41 1 2 50 76 30 0.183 -2.32 2.00 Prom - 96944 96905 40 -5.36 3.00 Prom + 99689 99728 40 -4.36 3.01 Init + 100001 100451 451 1 1 93 92 757 0.996 72.68 3.02 Intr + 105679 105739 61 2 1 113 108 -21 0.084 0.19 3.03 Intr + 111434 111579 146 2 2 54 48 102 0.109 2.73 3.04 Intr + 117926 118286 361 0 1 93 32 689 0.743 58.08 3.05 Term + 121096 121270 175 1 1 41 49 143 0.676 2.93 3.06 PlyA + 132274 132279 6 1.05 4.02 PlyA - 132298 132293 6 1.05 4.01 Sngl - 150770 150435 336 2 0 55 43 239 0.685 12.13 4.00 Prom - 153089 153050 40 -4.96 5.06 PlyA - 153257 153252 6 1.05 5.05 Term - 154251 154008 244 2 1 -28 54 213 0.406 1.67 5.04 Intr - 154430 154351 80 2 2 75 71 62 0.020 1.55 5.03 Intr - 163269 163132 138 0 0 72 37 149 0.009 8.86 5.02 Intr - 180733 180682 52 0 1 77 79 40 0.070 0.91 5.01 Init - 183444 183221 224 0 2 53 72 177 0.887 8.99 5.00 Prom - 187444 187405 40 -3.36 6.04 PlyA - 187491 187486 6 1.05 6.03 Term - 191222 191140 83 0 2 28 51 128 0.436 0.96 6.02 Intr - 208649 208343 307 2 1 6 42 223 0.004 5.52 6.01 Init - 218043 217969 75 0 0 78 93 93 0.168 9.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 43457 43644 188 0 2 151 38 67 0.836 5.65 S.002 Term - 45748 45684 65 2 2 89 49 67 0.908 0.95 S.003 Intr - 46236 46161 76 0 1 51 80 72 0.894 1.79 S.004 Term - 98051 97881 171 2 0 120 43 139 0.954 10.43 S.005 Term - 163269 163060 210 0 0 72 39 187 0.819 9.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:131189998_131408284|GENSCAN_predicted_peptide_1|1124_aa XATPSTKESSQPDAFSSGGGSKPSYEAIPESSPPSGITSASNTTPGEPAASSSRPVAPSG TALSTTSSKLETPPSKLGELLFPSSLAGETLGSFSGLRVGQADDSTKPTNKASSTSLTST QPTKTSGVPSGFNFTAPPVLGKHTEPPVTSSATTTSVAPPAATSTSSTAVFGSLPVTSAG SSGVISFGGTSLSAGKTSFSFGSQQTNSTVPPSAPPPTTAATPLPTSFPTLSFGSLLSSA TTPSLPMSAGRSTEEATSSALPEKPGDSEVSASAASLLEEQQSAQLPQAPPQTSDSVKKE PVLAQPAVSNSGTAASSTSLVALSAEATPATTGVPDARTEAVPPASSFSVPGQTAVTAAA ISSAGPVAVETSSTPIASSTTSIVAPGPSAEAAAFGTVTSGSSVFAQPPAASSSSAFNQL TNNTATAPSATPVFGQVAASTAPSLFGQQTGSTASTAAATPQVSSSGFSSPAFGTTAPGV FGQTTFGQASVFGQSASSAASVFSFSQPGFSSVPAFGQPASSTPTSTSGSVFGAASSTSS SSSFSFGQSSPNTGGGLFGQSNAPAFGQSPGFGQGGSVFGGTSAATTTAATSGFSFCQAS GFGSSNTGSVFGQAASTGGIVFGQQSSSSSGSVFGSGNTGRGGGFFSGLGGKPSQDAANK NPFSSASGGFGSTATSNTSNLFGNSGAKTFGGFASSSFGEQKPTGTFSSGGGSVASQGFG FSSPNKTGGFGAAPVFGSPPTFGGSPGFGGVPAFGSAPAFTSPLGSTGGKVFGEGTAAAS AGGFGFGSSSNTTSFGTLASQNAPTFGSLSQQTSGFGTQSSGFSGFGSGTGGFSFGSNNS VCPGSVGVQQQGLGFVDLLFSVAIWQEVPGLAFLRGVVQREKSTPANHSQEDCPVSVALR SQQSLHQPEHCVFKDGIRSHQRRVLINSIECAVVSLGPSALHIGKRLNADTHITLVIGAS DGNIGIEAKLSSSGTGRDPGPSRRTPVPHEVFPDPTADRQHSLQEWGSGTRKGPAWVTEP SCKRKHRPGEPGDPGPPHRRGAGTRPLEELLAFTVSAWSLVNGRLPSGKLLLRDSDPGWK AVGPAFRPPSHDRTARRRFCEPGGGPHHAETVGAFILDFQPPEL >gi568815589f:131189998_131408284|GENSCAN_predicted_CDS_1|3375_bp ngggcaacaccctccactaaagagtcaagccagccggacgcattctcatctggtggggga agcaaaccttcttatgaggccattcctgaaagctcacctccctcaggaatcacatccgca tcaaacaccaccccaggagaacctgccgcatctagcagcagacctgtggcaccttctgga actgctctttccaccacctctagtaagctggaaaccccaccgtccaagctgggagagctt ctgtttccaagttctttggctggagagactctgggaagtttttcaggactgcgggttggc caagcagatgattctacaaaaccaaccaataaggcttcatccacaagcctaactagtacc cagccaaccaagacgtcaggcgtgccctcagggtttaattttactgcccccccggtgtta gggaagcacacggagccccctgtgacatcctctgcaaccaccacctcagtagcaccacca gcagccaccagcacttcctcaactgccgtttttggcagtctgccagtcaccagtgcagga tcctctggggtcatcagttttggtgggacatctctaagtgctggcaagactagtttttca tttggaagccaacagaccaatagcacagtgcccccatctgccccaccaccaactacagct gccactccccttccaacatcattccccacattgtcatttggtagcctcctgagttcagca actaccccctccctgcctatgtccgctggcagaagcacagaagaggccacttcatcagct ttgcctgagaagccaggtgacagtgaggtctcagcatcagcagcctcacttctagaggag caacagtcagcccagcttccccaggctcctccgcaaacttctgactctgttaaaaaagaa cctgttcttgcccagcctgcagtcagcaactctggcactgcagcatctagtactagtctt gtagcactttctgcagaggctaccccagccaccacgggggtccctgatgccaggacggag gcagtaccacctgcttcctccttttctgtgcctgggcagactgctgtcacagcagctgct atctcaagtgcaggccctgtggccgtcgaaacatcaagtacccccatagcctccagcacc acgtccattgttgctcccggcccatctgcagaggcagcagcatttggtaccgtcacttct ggctcatccgtctttgctcagcctcctgctgccagttctagctcagctttcaaccagctc accaacaacacagccactgccccctctgccacgcccgtgtttgggcaagtggcagccagc accgcaccaagtctgtttgggcagcagactggtagcacagccagcacagcagctgccaca ccacaggtcagcagctcagggtttagcagcccagcttttggtaccacagccccaggggtc tttggacagacaaccttcgggcaggcctcagtctttgggcagtcggcgagcagtgctgca agtgtcttttccttcagtcagcctgggttcagttccgtgcctgccttcggtcagcctgct tcctccactcccacatccaccagtggaagtgtctttggtgccgcctcaagtaccagtagc tccagttccttctcatttggacagtcttctcccaacacaggaggggggctgtttggccaa agcaacgctcctgcttttgggcagagtcctggctttggacagggaggctctgtctttggt ggtacctcagctgccaccacaacagcagcaacctctgggttcagcttttgccaagcttca ggttttgggtctagtaatactggttctgtgtttggtcaagcagccagtactggtggaata gtctttggccagcaatcatcctcttccagtggtagcgtgtttgggtctggaaacactgga agagggggaggtttcttcagtggccttggaggaaaacccagtcaggatgcagccaacaaa aacccattcagctcggccagtgggggctttggatccacagctacctcaaatacctctaac ctatttggaaacagtggggccaagacatttggtggatttgccagctcgtcgtttggagag cagaaacccactggcactttcagctctggaggaggaagtgtggcatcccaaggctttggg ttttcctctccaaacaaaacaggtggcttcggtgctgctccagtgtttggcagccctcct acttttgggggatcccctgggtttggaggggtgccagcattcggttcagccccagccttt acaagccctctgggctcgacgggaggcaaagtgttcggagagggcactgcagctgccagc gcaggaggattcgggtttgggagcagcagcaacaccacatccttcggcacgctcgcgagt cagaatgcccccactttcggatcactgtcccaacagacttctggttttgggacccagagt agcggattctctggttttggatcaggcacaggagggttcagctttgggtcaaataactca gtttgtcctggaagtgtgggggttcagcagcagggtttgggttttgtggacttgctcttc tctgtagcaatatggcaggaggtgccaggcctcgccttcttaagaggcgtggttcaaaga gaaaagagcacgcctgccaatcacagccaagaagattgccccgtttctgtggctctgaga agccagcagagcctccatcagccagaacactgtgtcttcaaggatggcatcagaagtcac cagcgtcgggtgttgataaacagcatcgaatgtgccgtggtctcacttggacctagcgcc ctccacatagggaagaggttgaatgctgacacacatatcacactggtaattggtgcttca gatggcaacatagggatagaggccaaattgagctcctcagggacagggagagacccaggc cccagcagacggaccccagtgccccacgaagtcttccctgatcccacagctgaccggcag cactcacttcaggaatggggctcggggacccgcaaaggtccagcctgggtgacagagccc agctgcaagaggaagcaccgtcctggggagcccggagacccaggcccccctcacaggcga ggggctggcacgcggcctctggaggaacttctggccttcaccgtctccgcctggtcactc gtcaatggccggctgccttctggaaagctgctgctgcgcgacagtgaccctgggtggaag gctgtgggacctgccttccggcctccgtcacatgaccgcacagcgaggaggcgcttctgt gaaccaggaggtggccctcaccacgcagaaactgtgggcgccttcatcttggacttccag cctccagaactgtga >gi568815589f:131189998_131408284|GENSCAN_predicted_peptide_2|451_aa MAPGIIPDISGKGIHRHFTRAWMRHEPSTYAEAAVGLCPTPTELRKMTSRRKGCVGKRQG RRGRELGKPPEPCGSRSPPTPPVRDGISGDVTGGYYSLVLGGRGEAGGGGRSAAPGPCTA AGSDSGEGTRRPCVGTSGPRPRVGADGDPRRAPPSPASGRSPELASWKPRVGARPEGAGT SAGALLYAMGCIQSIGGKARVFREGITVIDVKASIDPVPTSIDESSSVVLRYRTPHFRAS AQVVMPPIPKKETWVVGWIQACSHMEFYNQYGEQGMSSWELPDLQEGKIQAISDSDGVNY PWYGNTTETCTIVGPTKRDSKFIISMNDNFYPSVTWAVPVSESNVAKLTNIYRDQSFTTW LVATNTSTNDMIILQTLHWRMQLSIEVNPNRPLGQRARLREPIAQDQPKILSKNEPIPPS ALVKPNANDAQVLMWRPKYGQPLVVIPPKHR >gi568815589f:131189998_131408284|GENSCAN_predicted_CDS_2|1356_bp atggctcctggcattatccccgacatctcagggaaggggatacaccggcacttcacacgc gcgtggatgcgtcatgagccttccacatatgctgaggctgccgtgggtctgtgcccaaca cccacggagctccggaaaatgacgagcaggaggaagggctgtgtagggaagcggcagggc cggaggggccgcgagctggggaagccaccagaaccctgcggcagccgctcgccccccacc ccgcccgtccgggacggtatcagcggagatgtcacgggcggctattattcgctggtcctc ggcggccgcggcgaagcaggcggcggcggccggagcgcagccccgggaccctgcacggcg gccggcagcgacagcggcgaagggacgcggcgcccctgcgtggggacgtccggcccgcgc ccgcgagtgggcgccgacggggacccgcgccgcgctcccccgtcaccggcgagcggccgg agccctgagctcgcctcctggaagccgcgggtcggcgctcgccccgagggcgccgggacc tcggccggagcgctcctgtatgccatgggctgtattcagagcatcggaggcaaagccaga gtcttccgggaagggatcacggtgattgatgtgaaagcctccatcgaccccgtccccact agcatcgatgagtcctccagcgtggtgctccgctaccggacaccccacttccgggcctcg gcccaggtggtcatgccgcccatccccaagaaggagacttgggtagttggctggatccag gcgtgcagccacatggagttctacaaccagtacggcgagcagggcatgtccagctgggag ctccccgacctccaggagggcaagatccaagccatcagcgactcggatggggtgaactac ccctggtacggcaacaccacagagacctgcaccatcgtgggccccaccaagagggactcc aagttcatcatcagcatgaatgacaacttttaccccagcgtcacatgggccgtgcccgtc agcgagagcaacgtggccaagctcaccaatatctaccgggaccagagcttcaccacctgg ctggtggccaccaacacctccaccaacgacatgatcatcctgcagacgctgcactggcgc atgcagctcagcatcgaggtgaaccccaaccggcccctgggccagcgcgcccggctgcgg gagcccatcgcccaggaccagcccaaaatcctgagcaagaatgagcccatcccgcccagc gccctggtcaagcccaatgccaacgatgcccaggtcctcatgtggcggcccaagtacggg cagccgctggtggtgatcccgcccaagcaccggtga >gi568815589f:131189998_131408284|GENSCAN_predicted_peptide_3|397_aa MPASQSRARARDRNNVLNRAEFLSLNQPPKGGPEPRSSGRKASGPSAQPPPAGDGARERR QSQQLPEEDCMQLNPSFKGIAFNSLLAIDICMSKRLGVCAGRAASWASARSMVKLIGITG HGIPWIGGTILCLVKSSTLAGQEVLMNLLLAPGNLHSTLCLCGFDCSRDLKFWQLFGHPD PGWPPVTNHHHQEALLGLASLTCLEVVEVLEPGPDNFDPALLLDIMTVAGVQKLIKRRGP YETSPSLLDYLTMDIYAFPAGHASRAAMVSKFFLSHLVLAVPLRVLLVLWALCVGLSRVM IGRHHVTDVLSGFVIGYLQFRLVELVWMPSSTCQMLISAWAPGVVAPGGKEGTGTRCADM AVSTKQTRDLTLPGFPWTSSLSDTDVSLEPSLCIPFP >gi568815589f:131189998_131408284|GENSCAN_predicted_CDS_3|1194_bp atgccagcttcccagagccgggcccgtgcccgggaccgcaacaacgtcctcaaccgggct gagttcctgtccctgaaccagccccccaaggggggcccggagccccgcagctcgggcaga aaggcctcgggcccatcagcacagcccccacctgctggtgacggggccagagagcgacgc cagtcacagcagctgccagaggaggactgcatgcagctgaacccctccttcaagggcatc gccttcaactccctgctggccatcgatatctgtatgtccaagcggctgggggtgtgcgct ggccgggcggcgtcctgggccagtgcccgctccatggtcaagctcatcggcatcacgggc cacggcatcccctggatcggaggcaccatcctctgcctggtgaagagcagcacactggcc ggccaggaggtgctcatgaatctgctcctggcccctggcaacctccattctacactctgt ctctgtggatttgactgttctagggacctcaaattctggcagctgttcgggcacccagac cctggctggccaccagtcaccaaccaccaccaccaggaggccttgcttggcctggcctct cttacttgtcttgaggtggtggaggtcttggaacctgggccagacaattttgaccctgcc ctgctcctggacatcatgacggtggccggcgtgcagaagctcatcaagcggcgcggcccg tacgagacgagccccagcctcctggactacctcaccatggacatctacgccttcccggcc gggcacgccagccgcgccgccatggtgtccaagttcttcctcagccacctggtgctggcg gtgcccctgcgtgtgctgctggtgctctgggccctctgcgtgggcctgtcccgcgtgatg atcggccgccaccacgtcacggacgtcctctccggctttgtcatcggctacctccagttc cgtctggtggagctggtctggatgccctccagcacctgccagatgctcatctctgcctgg gctcctggcgttgttgcccctggtggcaaggaaggaactgggacacgatgtgctgacatg gccgttagcacaaagcagacccgggatctgactcttccaggctttccctggacttcatca ctctcagacaccgacgtgtctctagaaccatcactctgcatcccctttccatga >gi568815589f:131189998_131408284|GENSCAN_predicted_peptide_4|111_aa MKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINMVKTAILPKVIYRFNAIPVKLPMTF FTELETTTLKFIWNQKRARIAKSILSKKNKAEGIMLPDFEPYYKATVTKTA >gi568815589f:131189998_131408284|GENSCAN_predicted_CDS_4|336_bp atgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggacaca aacaaatggaagaacattccatgctcatggataggaagaatcaatatggtgaaaacggcc atactgcccaaggtaatttatagatttaatgccatccctgtcaagctaccaatgactttc ttcacagaattggaaacaactactttaaagttcatatggaaccagaaaagagcccgcatc gccaagtcaatcctaagcaaaaagaacaaagctgaaggtatcatgctacctgactttgaa ccatactacaaggctacagtaaccaaaacagcatga >gi568815589f:131189998_131408284|GENSCAN_predicted_peptide_5|245_aa MLAVDSRRVPAPAAVLRVLGSARRSSAATVRLEGGGPAEAGWWPLPRERLSKMLTGALAR EIERKSDAVVCVTPQCFTCINSLNPYDNPLQQVYAHRIESRDSNRNSYANIHSEVIHNSQ KVGTIETSLSRGMDKHDVDRSSSPAMEQSWTGNDFDKLTEVGFRRINSIEKTLNDLMELK TMARELRDACTSFSSRFDQVQERVSVIEDHMNEMKREEKFREKRTKPPRNMGPGEKPKST FDWCT >gi568815589f:131189998_131408284|GENSCAN_predicted_CDS_5|738_bp atgctagcagttgactcccgcagggtcccggcgccggcagccgtgctgcgtgtgttgggt agtgcgaggcggtccagtgcagcgacagtccggctcgagggcggagggccagccgaagcg gggtggtggccgctgccccgggaaagactgtccaagatgctgacaggagctttggctcgt gagattgagaggaagtcagacgccgtggtctgtgtaaccccccagtgctttacatgcatc aattcattaaatccttacgacaaccctctgcagcaggtctatgcccacaggattgaaagc agggactcaaacaggaactcgtacgccaacattcatagcgaggttattcacaacagccaa aaggtgggaacaattgaaacatccctcagcagaggaatggacaagcacgatgtggatcgc agctcctcgccagcaatggaacaaagctggacagggaatgactttgacaagttgacagaa gtaggcttcagaagaataaacagcatagagaagaccttaaatgacctgatggagctgaaa accatggcacgagaactacgtgacgcatgcacaagcttcagtagccgatttgatcaagtg caagaaagggtatcagtgattgaagatcacatgaatgaaatgaagcgagaagagaagttt agagaaaaaagaacaaagcctccaagaaatatgggaccaggtgaaaagcccaaatctacc tttgattggtgtacctga >gi568815589f:131189998_131408284|GENSCAN_predicted_peptide_6|154_aa MTITRSTEGLLSNHSNLDAMPPVTPVLKEYLLPDKSIKNAINTAAAAPQRKRLMVILERR KTVQTEATQLAAEVRTEGSPVQDVKDRQQLEIGFVLPATPAARNASQISAAASTSEVHFS SDEGCFSVLFLLNRLRIFLNMETQEDTTTSEADM >gi568815589f:131189998_131408284|GENSCAN_predicted_CDS_6|465_bp atgactatcaccagaagcaccgaaggactcctcagcaaccactccaacctggatgcaatg cctcccgttaccccggtgctcaaggagtatctattgcctgacaagtcaattaaaaatgca ataaatactgctgctgcagccccacagagaaaaagactgatggttatcttggaaaggagg aagacggtccagacggaagccacacaactggctgcagaggtaagaacggaaggaagtcct gtacaagatgtgaaagaccgccagcaacttgagataggctttgttctgccagctactcct gctgcacgcaatgcttcacaaatcagtgctgctgccagcacctcggaggtgcacttcagc tcggacgaaggatgcttctcagtcctcttcctgctcaaccgcttgagaatattcctcaac atggagacccaggaagacacaacaactagtgaagctgacatgtga