GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:59:45 Sequence gi568815589r:131160825_131376179 : 215355 bp : 48.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2167 2349 183 1 0 90 64 121 0.938 8.80 1.02 Intr + 3046 3131 86 1 2 59 91 56 0.611 2.46 1.03 Intr + 13231 13494 264 2 0 45 111 138 0.239 9.28 1.04 Intr + 14636 14797 162 0 0 104 98 55 0.987 8.05 1.05 Intr + 17487 17586 100 1 1 52 63 58 0.525 -1.13 1.06 Intr + 26465 26540 76 2 1 93 93 72 0.627 7.72 1.07 Intr + 28229 28307 79 1 1 77 82 20 0.644 -0.58 1.08 Intr + 36392 38191 1800 0 0 88 110 617 0.861 51.22 1.09 Intr + 40823 40893 71 0 2 107 99 54 0.987 7.30 1.10 Intr + 54388 54544 157 0 1 90 108 40 0.903 5.78 1.11 Intr + 61954 62106 153 2 0 124 78 55 0.957 8.14 1.12 Intr + 67336 67507 172 2 1 96 102 95 0.933 10.70 1.13 Intr + 69806 69945 140 2 2 42 99 52 0.684 1.81 1.14 Intr + 71460 71484 25 1 1 110 99 8 0.523 1.18 1.15 Intr + 71597 71736 140 2 2 96 55 20 0.550 -0.39 1.16 Intr + 72903 73091 189 1 0 40 68 95 0.019 2.26 1.17 Intr + 75139 75216 78 2 0 71 75 36 0.019 0.12 1.18 Intr + 79067 79196 130 0 1 89 33 78 0.002 2.15 1.19 Intr + 91482 91663 182 0 2 99 36 200 0.954 15.41 1.20 Term + 91871 92007 137 0 2 90 49 119 0.985 6.38 1.21 PlyA + 93075 93080 6 1.05 2.07 PlyA - 97275 97270 6 1.05 2.06 Term - 100526 99998 529 1 1 112 48 1233 0.166 115.43 2.05 Intr - 115311 115033 279 2 0 -4 84 469 0.187 33.89 2.04 Intr - 116119 115866 254 1 2 96 33 154 0.192 6.93 2.03 Intr - 116402 116327 76 1 1 45 66 88 0.328 1.82 2.02 Intr - 119596 119420 177 0 0 102 56 107 0.341 7.93 2.01 Init - 121088 121048 41 2 2 50 76 30 0.183 -2.32 2.00 Prom - 126117 126078 40 -5.36 3.00 Prom + 128862 128901 40 -4.36 3.01 Init + 129174 129624 451 2 1 93 92 757 0.996 72.68 3.02 Intr + 134852 134912 61 0 1 113 108 -21 0.084 0.19 3.03 Intr + 140607 140752 146 0 2 54 48 102 0.109 2.73 3.04 Intr + 147099 147459 361 1 1 93 32 689 0.743 58.08 3.05 Term + 150269 150443 175 2 1 41 49 143 0.676 2.93 3.06 PlyA + 161447 161452 6 1.05 4.02 PlyA - 161471 161466 6 1.05 4.01 Sngl - 179943 179608 336 0 0 55 43 239 0.685 12.13 4.00 Prom - 182262 182223 40 -4.96 5.06 PlyA - 182430 182425 6 1.05 5.05 Term - 183424 183181 244 0 1 -28 54 213 0.406 1.67 5.04 Intr - 183603 183524 80 0 2 75 71 62 0.020 1.55 5.03 Intr - 192442 192305 138 1 0 72 37 149 0.009 8.86 5.02 Intr - 209906 209855 52 1 1 77 79 40 0.071 0.91 5.01 Init - 212617 212394 224 1 2 53 72 177 0.884 8.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 72630 72817 188 1 2 151 38 67 0.836 5.65 S.002 Term - 74921 74857 65 0 2 89 49 67 0.908 0.95 S.003 Intr - 75409 75334 76 1 1 51 80 72 0.894 1.79 S.004 Term - 127224 127054 171 0 0 120 43 139 0.954 10.43 S.005 Term - 192442 192233 210 1 0 72 39 187 0.819 9.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:131160825_131376179|GENSCAN_predicted_peptide_1|1441_aa XHLLVPERETLFNTLANNREIINQQRKRLNHLVDSLQQLRLYKQTSLWSLSSAVPSQSSI HSFDSDLESLCNALLKTTIESHTKSLPKVPASLSRSAFLSQRYYEDLDEVSSTSSVSQSL ESEDARTSCKDDEAVVQAPRHAPVVRTPSIQPSLLPHAAPFAKSHLVHGSSPGVMGTSVA TSASKIIPQGADSTMLATKTVKHGAPSPSHPISAPQAAAAAALRRQMASQAPAVNTLTES TLKNVPQVVNVQELKNNPATPSTAMGSSVPYSTAKTPHPVLTPVAANQAKQGSLINSLKP SGPTPASGQLSSGDKASGATPSTKESSQPDAFSSGGGSKPSYEAIPESSPPSGITSASNT TPGEPAASSSRPVAPSGTALSTTSSKLETPPSKLGELLFPSSLAGETLGSFSGLRVGQAD DSTKPTNKASSTSLTSTQPTKTSGVPSGFNFTAPPVLGKHTEPPVTSSATTTSVAPPAAT STSSTAVFGSLPVTSAGSSGVISFGGTSLSAGKTSFSFGSQQTNSTVPPSAPPPTTAATP LPTSFPTLSFGSLLSSATTPSLPMSAGRSTEEATSSALPEKPGDSEVSASAASLLEEQQS AQLPQAPPQTSDSVKKEPVLAQPAVSNSGTAASSTSLVALSAEATPATTGVPDARTEAVP PASSFSVPGQTAVTAAAISSAGPVAVETSSTPIASSTTSIVAPGPSAEAAAFGTVTSGSS VFAQPPAASSSSAFNQLTNNTATAPSATPVFGQVAASTAPSLFGQQTGSTASTAAATPQV SSSGFSSPAFGTTAPGVFGQTTFGQASVFGQSASSAASVFSFSQPGFSSVPAFGQPASST PTSTSGSVFGAASSTSSSSSFSFGQSSPNTGGGLFGQSNAPAFGQSPGFGQGGSVFGGTS AATTTAATSGFSFCQASGFGSSNTGSVFGQAASTGGIVFGQQSSSSSGSVFGSGNTGRGG GFFSGLGGKPSQDAANKNPFSSASGGFGSTATSNTSNLFGNSGAKTFGGFASSSFGEQKP TGTFSSGGGSVASQGFGFSSPNKTGGFGAAPVFGSPPTFGGSPGFGGVPAFGSAPAFTSP LGSTGGKVFGEGTAAASAGGFGFGSSSNTTSFGTLASQNAPTFGSLSQQTSGFGTQSSGF SGFGSGTGGFSFGSNNSVCPGSVGVQQQGLGFVDLLFSVAIWQEVPGLAFLRGVVQREKS TPANHSQEDCPVSVALRSQQSLHQPEHCVFKDGIRSHQRRVLINSIECAVVSLGPSALHI GKRLNADTHITLVIGASDGNIGIEAKLSSSGTGRDPGPSRRTPVPHEVFPDPTADRQHSL QEWGSGTRKGPAWVTEPSCKRKHRPGEPGDPGPPHRRGAGTRPLEELLAFTVSAWSLVNG RLPSGKLLLRDSDPGWKAVGPAFRPPSHDRTARRRFCEPGGGPHHAETVGAFILDFQPPE L >gi568815589r:131160825_131376179|GENSCAN_predicted_CDS_1|4326_bp nngcacctgcttgtgccagagcgagagacactgtttaacaccctagccaacaatcgggaa atcatcaaccaacagaggaagaggctgaatcacctggtggatagtcttcagcagctccgc ctttacaaacagacttccctgtggagcctgtcctcggctgttccttcccagagcagcatt cacagttttgacagtgacctggaaagcctgtgcaatgctttgttgaaaaccaccatagaa tctcacaccaaatccttgcccaaagtaccagccagcctgtctcgatcagcctttctgtct cagagatattatgaagacttggatgaagtcagctcaacgtcatctgtctcccagtctctg gagagtgaagatgcacggacgtcctgtaaagatgacgaggcagtggttcaggcccctcgg cacgcccccgtggttcgcactccttccatccagcccagtctcttgccccatgcagcacct tttgctaaatctcacctggttcatggttcttcacctggtgtgatgggaacttcagtggct acatctgctagcaaaattattcctcaaggggccgatagcacaatgcttgccacgaaaacc gtgaaacatggtgcacctagtccttcccaccccatctcagccccgcaggcagctgccgca gcagcactcaggcggcagatggccagtcaggcaccagctgtaaacactttgactgaatca acgttgaagaatgtccctcaagtggtaaatgtgcaggaattgaagaataaccctgcaacc ccttctacagccatgggttcttcagtgccctactccacagccaaaacacctcacccagtg ttgaccccagtggctgctaaccaagccaagcaggggtctctaataaattcccttaagcca tctgggcctacaccagcatccggtcagttatcatctggtgacaaagcttcaggggcaaca ccctccactaaagagtcaagccagccggacgcattctcatctggtgggggaagcaaacct tcttatgaggccattcctgaaagctcacctccctcaggaatcacatccgcatcaaacacc accccaggagaacctgccgcatctagcagcagacctgtggcaccttctggaactgctctt tccaccacctctagtaagctggaaaccccaccgtccaagctgggagagcttctgtttcca agttctttggctggagagactctgggaagtttttcaggactgcgggttggccaagcagat gattctacaaaaccaaccaataaggcttcatccacaagcctaactagtacccagccaacc aagacgtcaggcgtgccctcagggtttaattttactgcccccccggtgttagggaagcac acggagccccctgtgacatcctctgcaaccaccacctcagtagcaccaccagcagccacc agcacttcctcaactgccgtttttggcagtctgccagtcaccagtgcaggatcctctggg gtcatcagttttggtgggacatctctaagtgctggcaagactagtttttcatttggaagc caacagaccaatagcacagtgcccccatctgccccaccaccaactacagctgccactccc cttccaacatcattccccacattgtcatttggtagcctcctgagttcagcaactaccccc tccctgcctatgtccgctggcagaagcacagaagaggccacttcatcagctttgcctgag aagccaggtgacagtgaggtctcagcatcagcagcctcacttctagaggagcaacagtca gcccagcttccccaggctcctccgcaaacttctgactctgttaaaaaagaacctgttctt gcccagcctgcagtcagcaactctggcactgcagcatctagtactagtcttgtagcactt tctgcagaggctaccccagccaccacgggggtccctgatgccaggacggaggcagtacca cctgcttcctccttttctgtgcctgggcagactgctgtcacagcagctgctatctcaagt gcaggccctgtggccgtcgaaacatcaagtacccccatagcctccagcaccacgtccatt gttgctcccggcccatctgcagaggcagcagcatttggtaccgtcacttctggctcatcc gtctttgctcagcctcctgctgccagttctagctcagctttcaaccagctcaccaacaac acagccactgccccctctgccacgcccgtgtttgggcaagtggcagccagcaccgcacca agtctgtttgggcagcagactggtagcacagccagcacagcagctgccacaccacaggtc agcagctcagggtttagcagcccagcttttggtaccacagccccaggggtctttggacag acaaccttcgggcaggcctcagtctttgggcagtcggcgagcagtgctgcaagtgtcttt tccttcagtcagcctgggttcagttccgtgcctgccttcggtcagcctgcttcctccact cccacatccaccagtggaagtgtctttggtgccgcctcaagtaccagtagctccagttcc ttctcatttggacagtcttctcccaacacaggaggggggctgtttggccaaagcaacgct cctgcttttgggcagagtcctggctttggacagggaggctctgtctttggtggtacctca gctgccaccacaacagcagcaacctctgggttcagcttttgccaagcttcaggttttggg tctagtaatactggttctgtgtttggtcaagcagccagtactggtggaatagtctttggc cagcaatcatcctcttccagtggtagcgtgtttgggtctggaaacactggaagaggggga ggtttcttcagtggccttggaggaaaacccagtcaggatgcagccaacaaaaacccattc agctcggccagtgggggctttggatccacagctacctcaaatacctctaacctatttgga aacagtggggccaagacatttggtggatttgccagctcgtcgtttggagagcagaaaccc actggcactttcagctctggaggaggaagtgtggcatcccaaggctttgggttttcctct ccaaacaaaacaggtggcttcggtgctgctccagtgtttggcagccctcctacttttggg ggatcccctgggtttggaggggtgccagcattcggttcagccccagcctttacaagccct ctgggctcgacgggaggcaaagtgttcggagagggcactgcagctgccagcgcaggagga ttcgggtttgggagcagcagcaacaccacatccttcggcacgctcgcgagtcagaatgcc cccactttcggatcactgtcccaacagacttctggttttgggacccagagtagcggattc tctggttttggatcaggcacaggagggttcagctttgggtcaaataactcagtttgtcct ggaagtgtgggggttcagcagcagggtttgggttttgtggacttgctcttctctgtagca atatggcaggaggtgccaggcctcgccttcttaagaggcgtggttcaaagagaaaagagc acgcctgccaatcacagccaagaagattgccccgtttctgtggctctgagaagccagcag agcctccatcagccagaacactgtgtcttcaaggatggcatcagaagtcaccagcgtcgg gtgttgataaacagcatcgaatgtgccgtggtctcacttggacctagcgccctccacata gggaagaggttgaatgctgacacacatatcacactggtaattggtgcttcagatggcaac atagggatagaggccaaattgagctcctcagggacagggagagacccaggccccagcaga cggaccccagtgccccacgaagtcttccctgatcccacagctgaccggcagcactcactt caggaatggggctcggggacccgcaaaggtccagcctgggtgacagagcccagctgcaag aggaagcaccgtcctggggagcccggagacccaggcccccctcacaggcgaggggctggc acgcggcctctggaggaacttctggccttcaccgtctccgcctggtcactcgtcaatggc cggctgccttctggaaagctgctgctgcgcgacagtgaccctgggtggaaggctgtggga cctgccttccggcctccgtcacatgaccgcacagcgaggaggcgcttctgtgaaccagga ggtggccctcaccacgcagaaactgtgggcgccttcatcttggacttccagcctccagaa ctgtga >gi568815589r:131160825_131376179|GENSCAN_predicted_peptide_2|451_aa MAPGIIPDISGKGIHRHFTRAWMRHEPSTYAEAAVGLCPTPTELRKMTSRRKGCVGKRQG RRGRELGKPPEPCGSRSPPTPPVRDGISGDVTGGYYSLVLGGRGEAGGGGRSAAPGPCTA AGSDSGEGTRRPCVGTSGPRPRVGADGDPRRAPPSPASGRSPELASWKPRVGARPEGAGT SAGALLYAMGCIQSIGGKARVFREGITVIDVKASIDPVPTSIDESSSVVLRYRTPHFRAS AQVVMPPIPKKETWVVGWIQACSHMEFYNQYGEQGMSSWELPDLQEGKIQAISDSDGVNY PWYGNTTETCTIVGPTKRDSKFIISMNDNFYPSVTWAVPVSESNVAKLTNIYRDQSFTTW LVATNTSTNDMIILQTLHWRMQLSIEVNPNRPLGQRARLREPIAQDQPKILSKNEPIPPS ALVKPNANDAQVLMWRPKYGQPLVVIPPKHR >gi568815589r:131160825_131376179|GENSCAN_predicted_CDS_2|1356_bp atggctcctggcattatccccgacatctcagggaaggggatacaccggcacttcacacgc gcgtggatgcgtcatgagccttccacatatgctgaggctgccgtgggtctgtgcccaaca cccacggagctccggaaaatgacgagcaggaggaagggctgtgtagggaagcggcagggc cggaggggccgcgagctggggaagccaccagaaccctgcggcagccgctcgccccccacc ccgcccgtccgggacggtatcagcggagatgtcacgggcggctattattcgctggtcctc ggcggccgcggcgaagcaggcggcggcggccggagcgcagccccgggaccctgcacggcg gccggcagcgacagcggcgaagggacgcggcgcccctgcgtggggacgtccggcccgcgc ccgcgagtgggcgccgacggggacccgcgccgcgctcccccgtcaccggcgagcggccgg agccctgagctcgcctcctggaagccgcgggtcggcgctcgccccgagggcgccgggacc tcggccggagcgctcctgtatgccatgggctgtattcagagcatcggaggcaaagccaga gtcttccgggaagggatcacggtgattgatgtgaaagcctccatcgaccccgtccccact agcatcgatgagtcctccagcgtggtgctccgctaccggacaccccacttccgggcctcg gcccaggtggtcatgccgcccatccccaagaaggagacttgggtagttggctggatccag gcgtgcagccacatggagttctacaaccagtacggcgagcagggcatgtccagctgggag ctccccgacctccaggagggcaagatccaagccatcagcgactcggatggggtgaactac ccctggtacggcaacaccacagagacctgcaccatcgtgggccccaccaagagggactcc aagttcatcatcagcatgaatgacaacttttaccccagcgtcacatgggccgtgcccgtc agcgagagcaacgtggccaagctcaccaatatctaccgggaccagagcttcaccacctgg ctggtggccaccaacacctccaccaacgacatgatcatcctgcagacgctgcactggcgc atgcagctcagcatcgaggtgaaccccaaccggcccctgggccagcgcgcccggctgcgg gagcccatcgcccaggaccagcccaaaatcctgagcaagaatgagcccatcccgcccagc gccctggtcaagcccaatgccaacgatgcccaggtcctcatgtggcggcccaagtacggg cagccgctggtggtgatcccgcccaagcaccggtga >gi568815589r:131160825_131376179|GENSCAN_predicted_peptide_3|397_aa MPASQSRARARDRNNVLNRAEFLSLNQPPKGGPEPRSSGRKASGPSAQPPPAGDGARERR QSQQLPEEDCMQLNPSFKGIAFNSLLAIDICMSKRLGVCAGRAASWASARSMVKLIGITG HGIPWIGGTILCLVKSSTLAGQEVLMNLLLAPGNLHSTLCLCGFDCSRDLKFWQLFGHPD PGWPPVTNHHHQEALLGLASLTCLEVVEVLEPGPDNFDPALLLDIMTVAGVQKLIKRRGP YETSPSLLDYLTMDIYAFPAGHASRAAMVSKFFLSHLVLAVPLRVLLVLWALCVGLSRVM IGRHHVTDVLSGFVIGYLQFRLVELVWMPSSTCQMLISAWAPGVVAPGGKEGTGTRCADM AVSTKQTRDLTLPGFPWTSSLSDTDVSLEPSLCIPFP >gi568815589r:131160825_131376179|GENSCAN_predicted_CDS_3|1194_bp atgccagcttcccagagccgggcccgtgcccgggaccgcaacaacgtcctcaaccgggct gagttcctgtccctgaaccagccccccaaggggggcccggagccccgcagctcgggcaga aaggcctcgggcccatcagcacagcccccacctgctggtgacggggccagagagcgacgc cagtcacagcagctgccagaggaggactgcatgcagctgaacccctccttcaagggcatc gccttcaactccctgctggccatcgatatctgtatgtccaagcggctgggggtgtgcgct ggccgggcggcgtcctgggccagtgcccgctccatggtcaagctcatcggcatcacgggc cacggcatcccctggatcggaggcaccatcctctgcctggtgaagagcagcacactggcc ggccaggaggtgctcatgaatctgctcctggcccctggcaacctccattctacactctgt ctctgtggatttgactgttctagggacctcaaattctggcagctgttcgggcacccagac cctggctggccaccagtcaccaaccaccaccaccaggaggccttgcttggcctggcctct cttacttgtcttgaggtggtggaggtcttggaacctgggccagacaattttgaccctgcc ctgctcctggacatcatgacggtggccggcgtgcagaagctcatcaagcggcgcggcccg tacgagacgagccccagcctcctggactacctcaccatggacatctacgccttcccggcc gggcacgccagccgcgccgccatggtgtccaagttcttcctcagccacctggtgctggcg gtgcccctgcgtgtgctgctggtgctctgggccctctgcgtgggcctgtcccgcgtgatg atcggccgccaccacgtcacggacgtcctctccggctttgtcatcggctacctccagttc cgtctggtggagctggtctggatgccctccagcacctgccagatgctcatctctgcctgg gctcctggcgttgttgcccctggtggcaaggaaggaactgggacacgatgtgctgacatg gccgttagcacaaagcagacccgggatctgactcttccaggctttccctggacttcatca ctctcagacaccgacgtgtctctagaaccatcactctgcatcccctttccatga >gi568815589r:131160825_131376179|GENSCAN_predicted_peptide_4|111_aa MKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINMVKTAILPKVIYRFNAIPVKLPMTF FTELETTTLKFIWNQKRARIAKSILSKKNKAEGIMLPDFEPYYKATVTKTA >gi568815589r:131160825_131376179|GENSCAN_predicted_CDS_4|336_bp atgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggacaca aacaaatggaagaacattccatgctcatggataggaagaatcaatatggtgaaaacggcc atactgcccaaggtaatttatagatttaatgccatccctgtcaagctaccaatgactttc ttcacagaattggaaacaactactttaaagttcatatggaaccagaaaagagcccgcatc gccaagtcaatcctaagcaaaaagaacaaagctgaaggtatcatgctacctgactttgaa ccatactacaaggctacagtaaccaaaacagcatga >gi568815589r:131160825_131376179|GENSCAN_predicted_peptide_5|245_aa MLAVDSRRVPAPAAVLRVLGSARRSSAATVRLEGGGPAEAGWWPLPRERLSKMLTGALAR EIERKSDAVVCVTPQCFTCINSLNPYDNPLQQVYAHRIESRDSNRNSYANIHSEVIHNSQ KVGTIETSLSRGMDKHDVDRSSSPAMEQSWTGNDFDKLTEVGFRRINSIEKTLNDLMELK TMARELRDACTSFSSRFDQVQERVSVIEDHMNEMKREEKFREKRTKPPRNMGPGEKPKST FDWCT >gi568815589r:131160825_131376179|GENSCAN_predicted_CDS_5|738_bp atgctagcagttgactcccgcagggtcccggcgccggcagccgtgctgcgtgtgttgggt agtgcgaggcggtccagtgcagcgacagtccggctcgagggcggagggccagccgaagcg gggtggtggccgctgccccgggaaagactgtccaagatgctgacaggagctttggctcgt gagattgagaggaagtcagacgccgtggtctgtgtaaccccccagtgctttacatgcatc aattcattaaatccttacgacaaccctctgcagcaggtctatgcccacaggattgaaagc agggactcaaacaggaactcgtacgccaacattcatagcgaggttattcacaacagccaa aaggtgggaacaattgaaacatccctcagcagaggaatggacaagcacgatgtggatcgc agctcctcgccagcaatggaacaaagctggacagggaatgactttgacaagttgacagaa gtaggcttcagaagaataaacagcatagagaagaccttaaatgacctgatggagctgaaa accatggcacgagaactacgtgacgcatgcacaagcttcagtagccgatttgatcaagtg caagaaagggtatcagtgattgaagatcacatgaatgaaatgaagcgagaagagaagttt agagaaaaaagaacaaagcctccaagaaatatgggaccaggtgaaaagcccaaatctacc tttgattggtgtacctga