GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:55:30 Sequence gi568815597f:236383165_236584328 : 201164 bp : 44.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12466 12496 31 0 1 118 103 40 0.844 8.26 1.02 Intr + 22803 22994 192 1 0 83 80 29 0.032 1.06 1.03 Intr + 44228 44286 59 0 2 110 110 39 0.055 6.90 1.04 Intr + 68583 68788 206 2 2 76 110 30 0.010 2.00 1.05 Intr + 85067 85112 46 2 1 91 99 53 0.085 5.11 1.06 Intr + 86438 86483 46 1 1 66 91 35 0.084 -0.42 1.07 Term + 99103 99485 383 2 2 93 43 313 0.977 22.30 1.08 PlyA + 99705 99710 6 -1.75 2.00 Prom + 99813 99852 40 -6.86 2.01 Init + 100001 101094 1094 1 2 72 62 1309 0.007 120.59 2.02 Intr + 108919 108991 73 1 1 83 100 78 0.318 7.91 2.03 Intr + 154333 154421 89 0 2 83 98 57 0.633 4.97 2.04 Intr + 155715 155925 211 0 1 76 96 169 0.699 15.42 2.05 Intr + 157400 157519 120 1 0 122 109 116 0.999 17.79 2.06 Intr + 159751 159876 126 0 0 67 42 75 0.696 1.68 2.07 Term + 160396 160551 156 0 0 75 47 136 0.616 6.13 2.08 PlyA + 165623 165628 6 1.05 3.28 PlyA - 166689 166684 6 1.05 3.27 Term - 167826 167738 89 1 2 130 47 74 0.993 5.32 3.26 Intr - 168943 168835 109 1 1 108 86 24 0.993 4.06 3.25 Intr - 170575 170417 159 1 0 85 72 134 0.997 11.68 3.24 Intr - 171588 171434 155 1 2 89 91 -9 0.912 -0.71 3.23 Intr - 172300 172132 169 1 1 47 94 158 0.912 11.82 3.22 Intr - 172491 172387 105 0 0 104 61 68 0.968 6.11 3.21 Intr - 172775 172641 135 2 0 46 99 85 0.986 6.16 3.20 Intr - 173094 172936 159 0 0 104 72 83 0.960 8.48 3.19 Intr - 174181 174031 151 0 1 70 99 188 0.960 18.26 3.18 Intr - 175365 175073 293 0 2 105 85 236 0.994 20.93 3.17 Intr - 175971 175831 141 0 0 75 73 105 0.996 8.25 3.16 Intr - 176673 176550 124 2 1 82 100 157 0.999 16.89 3.15 Intr - 177281 177223 59 2 2 110 62 19 0.443 -0.82 3.14 Intr - 181497 181334 164 1 2 85 90 9 0.818 0.49 3.13 Intr - 182881 182755 127 1 1 88 121 16 0.970 5.15 3.12 Intr - 183712 183482 231 1 0 96 77 193 0.999 16.97 3.11 Intr - 185960 185832 129 2 0 95 110 46 0.997 8.39 3.10 Intr - 188308 188187 122 2 2 73 119 144 0.996 16.21 3.09 Intr - 188522 188404 119 1 2 36 100 84 0.989 4.51 3.08 Intr - 189390 189247 144 2 0 89 90 40 0.926 3.70 3.07 Intr - 189664 189561 104 1 2 73 63 98 0.997 4.77 3.06 Intr - 191169 191038 132 0 0 109 86 27 0.976 5.44 3.05 Intr - 191739 191497 243 0 0 32 86 90 0.429 0.89 3.04 Intr - 193785 193616 170 1 2 78 115 25 0.847 3.87 3.03 Intr - 198250 198058 193 1 1 78 67 39 0.880 -0.13 3.02 Intr - 199708 199572 137 2 2 88 82 67 0.963 6.39 3.01 Intr - 200032 199849 184 1 1 72 116 22 0.680 2.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 50894 50810 85 1 1 57 38 168 0.933 5.83 S.002 Sngl + 100001 101167 1167 1 0 72 43 1350 0.933 125.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:236383165_236584328|GENSCAN_predicted_peptide_1|320_aa MASPDDPLRAGSRINLSWTQRALFAEDGVRAFKAPSLTPPLAPHSKPIYLKFHCVDAINK PHSWQDGIFKPYPPAEECDTITLNCPRNSDMKNQLRPDLKFGGLKVRRLDLALSLLPGDE NLATAQLPDMGSALPAEFLHLPPAANPSSFSPRNTPCTSYLSREKKMAFQIALEILFQGG EPVEPSGAAIISPEISKDNSCKENCTCSSCLLRAPTISDLLNDQDLLDVIRIKLDPCHPT VKNWRNFASKWGMSYDELCFLEQRPQSPTLEFLLRNSQRTVGQLMELCRLYHRADVEKVL RRWVDEEWPKRERGDPSRHF >gi568815597f:236383165_236584328|GENSCAN_predicted_CDS_1|963_bp atggcttcaccggacgaccctctgcgcgcaggaagcaggattaacttgtcctggacgcag cgggcattgtttgcagaggatggtgtcagagcatttaaagcccccagcctcaccccaccc ctggctccacactccaaacccatttacctgaaatttcactgtgttgatgctattaacaag ccacactcctggcaggatggcatttttaagccgtatcctccagctgaagaatgtgataca attactttgaactgcccacgaaattcagatatgaaaaatcagctcaggcctgatctgaaa tttggaggccttaaagttaggcggttggacctggctctctccctcctccccggagatgag aatttggccactgcacaattacctgatatgggcagtgcacttccagctgaatttttacac ctccccccagcagccaaccccagctccttctctcccaggaatacaccttgcacctcttac ctgagcagggagaagaaaatggctttccagatagcactggagatcctcttccagggtggg gaacccgtagagccctcaggggcagctatcatcagcccagagatcagcaaggacaactcc tgcaaagaaaactgtacttgttcctcctgcttgctccgggcccccaccataagtgacttg ctcaatgatcaggacttactagacgtgatcaggataaagctggatccgtgtcacccaacg gtgaaaaactggaggaattttgcaagcaaatgggggatgtcctatgacgaattgtgcttc ctggagcagaggccacagagccccaccttggagttcttgctccggaacagtcagaggacg gtgggccagctgatggagctctgcaggctctaccacagggccgacgtggagaaggttctg cgcaggtgggtggacgaggagtggcccaagcgggagcgtggagacccctccaggcacttc tag >gi568815597f:236383165_236584328|GENSCAN_predicted_peptide_2|622_aa MSILKIHARELFDSRGNPTVEVDLFTSEGLFRAAVPSGASTGIYEVLELQDNDKTRYMGK GVSKPVEPINKTIAPVLVSKKLNVTEQEKIDKLMIEMDGTENKSKFGANAILGVSLAACK ASAVEKGVPLYHHIADLSGNSKVILPVPVFNVINGSSHAVTKLAMQEFMVLPVGAANFRE AMPIGAEVYHSLKNVIKEKYGKDATGVGDGGAFAPNILENKEGLELLKTAIGKAGYTDKV IVSMDVEASEFFRSGKYDLEFKFLDDPTRYISPDCLADLYKSFIKNYPVVSTEDPFDQDD WGAWQKFTASAGIQVVEDDLRVTNPKRTASAVNEKKCNCLLLKVNQIRSVTESLQACKLA QANGCPVTPESSPEAGPRSFMEDVPQLTKVIPFVGTIPDQLDPGTLIVIRGHVPSDADRF QVDLQNGSSMKPRADVAFHFNPRFKRAGCIVCNTLINEKWGREEITYDTPFKREKSFEIV IMVLKDKFQVAVNGKHTLLYGHRIGPEKIDTLGIYGKVNIHSIGFSFSSPSNRGGDISKI APRTVYTKSKDSTVNHTLTCTKIPPMNYVSKRLPFAARLNTPMGPGRTVVVKGEVNANAK RSVSFGTSHSADTSVPVTAFHP >gi568815597f:236383165_236584328|GENSCAN_predicted_CDS_2|1869_bp atgtctattctcaagatccatgccagggagctctttgactctcgtgggaatcccactgtt gaggttgatctcttcacctcagaaggtctcttcagagctgctgtgcccagtggtgcttca actggtatctatgaggtcctagagctccaggacaatgataagactcgctatatggggaag ggtgtctcaaagcctgttgagcccatcaataaaactattgcacctgtcctggttagcaag aaactgaacgtcacagaacaagagaagattgacaaacttatgatagagatggatggaaca gaaaataaatctaaatttggtgcaaatgccattctgggagtgtccctcgctgcctgcaaa gctagtgctgttgagaagggggttcccctgtaccaccacatcgccgacttgtctggcaac tccaaagtcatcttgccagtcccggtgttcaatgtcatcaatggcagttctcatgctgtc accaagctggccatgcaggagttcatggtcctcccagtcggtgcagcaaacttcagggaa gccatgcccattggagcggaggtttaccacagcctgaagaatgtcatcaaggagaaatat gggaaagatgccaccggtgtgggggatggaggcgcgtttgctcccaacatcctggagaat aaagaaggcctggagctgctgaagactgcgattgggaaagctggctacactgataaggtg atcgtcagcatggacgtagaggcctccgagttcttcaggtctggaaagtatgacctggaa ttcaagtttctcgacgaccccaccaggtacatctcacctgactgtctggctgacctgtac aagtccttcatcaaaaactacccagtggtgtctactgaagatccctttgaccaggatgac tggggagcttggcagaagttcacggccagtgcaggaatccaggtagtggaggatgatctc agagtgaccaacccaaagaggacagcctcggccgtgaatgagaagaagtgcaactgcctc ctgctcaaagtgaaccagattcgctctgtgactgagtcccttcaggcgtgcaagctggcc caggccaatggttgcccagtgacccctgaatcttcccctgaggcaggtccccgaagcttc atggaggatgttcctcagctgaccaaggtaatcccgtttgttggcaccattcctgatcag ctggatcctggaactttgattgtgatacgtgggcatgttcctagtgacgcagacagattc caggtggatctgcagaatggcagcagcatgaaacctcgagccgatgtggcctttcatttc aatcctcgtttcaaaagggccggctgcattgtttgcaatactttgataaatgaaaaatgg ggacgggaagagatcacctatgacacgcctttcaaaagagaaaagtcttttgagatcgtg attatggtgctgaaggacaaattccaggtggctgtaaatggaaaacatactctgctctat ggccacaggatcggcccagagaaaatagacactctgggcatttatggcaaagtgaatatt cactcaattggttttagcttcagctcgcctagtaatagaggaggagacatttctaaaatc gcacccagaactgtctacaccaagagcaaagattcgactgtcaatcacactttgacttgc accaaaataccacctatgaactatgtgtcaaagaggctgccattcgctgcaaggttgaac acccccatgggccctggacgaactgtcgtcgttaaaggagaagtgaatgcaaatgccaaa aggtcagtatccttcggtaccagtcacagtgcagatacttccgtgcctgttaccgccttc cacccgtga >gi568815597f:236383165_236584328|GENSCAN_predicted_peptide_3|1348_aa EIPSEWHIELMLDRGIPVELWAHYVEELNSTQRVAVEDSVFLVFSLKKFIYALKAPKSFP KGDIWWNPEQLKEDSRDYLHLLIGLFEMMLNGADAVHFRVLMKLFIKVHLEDVFQLFKFC SVLWTYGSSLSNPLNCSVKTVLQTQALYVGCAMLSSQKTQCKHQLASISSPVVTSLLINL GSPVKEVRRAAIQCLQALSGVASPFYLIIDHLISKAEEITSDAAYVIQMVLSQLLPMAEQ LLEKIQKEPTAVLKDEAMVLHLTLGKYNEFSVSLLNEDPKSLDIFIKAVHTTKELYAGMP TIQITALEKITKPFFAAISDEKVQQKLLRMLFDLLVNCKNSHCAQTVSSVFKGISVNAEQ VRIELEPPDKAKPLGTVQQKRRQKMQQKKSQDLESVQEVGGSYWQRVTLILELLQHKKKL RSPQILVPTLFNLLSRCLEPLPQEQGNMEYTKQLILSCLLNICQKLSPDGGKIPKDILDE EKFNVELIVQCIRLSEMPQTHHHALLLLGTVAGIFPDKVLHNIMSIFTFMGANVMRLDDT YSFQVINKTVKMVIPALIQSDSGDSIEVSRNVEEIVVKIISVFVDALPHVPEHRRLPILV QLVDTLGAEKFLWILLILLFEQYVTKTVLAAAYGEKDAILEADTEFWFSVCCEFSVQHQI QSLMNILQYLLKLPEEKEETIPKAVSFNKSESQEEMLQVFNVETHTSKQLRHFKFLSVSF MSQLLSSNNFLKKLPPQAGSSGGISSGSCVGIWLLETVLGYISAVAQSMERNADKLTVKF WRALLSKAYDLLDKVNALLPTETFIPVIRGLVGNPLPSVRRKALDLLNNKLQQNISWKKT IVTRFLKLVPDLLAIVQRKKKEGEEEQAINRQTALYTLKLLCKNFGAENPDPFVPVLNTA VKLIAPERKEEKNVLGSALLCIAEVTSTLEALAIPQLPSLMPSLLTTMKNTSELVSSEVY LLSALAALQKVVETLPHFISPYLEGILSQVIHLEKITSEMGSASQANIRLTSLKKTLATT LAPRVLLPAIKKTYKQIEKNWKNHMGPFMSILQEHIGVMKKEELTSHQSQLTAFFLEALD FRAQHSENDLEEVGKTENCIIDCLVAMVVKLSEVTFRPLFFKLFDWAKTEDAPKDRLLTF YNLADCIAEKLKGLFTLFAGHLVKPFADTLNQVNISKTDEAFFDSENDPEKCCLLLQFIL NCLYKIFLFDTQHFISKERAEALMMPLVDQLENRLGGEEKFQERVTKHLIPCIAQFSVAM ADDSLWKPLNYQILLKTRDSSPKVRFAALITVLALAEKLKENYIVLLPESIPFLAELMED ECEEVEHQCQKTIQQLETVLGEPLQSYF >gi568815597f:236383165_236584328|GENSCAN_predicted_CDS_3|4047_bp gaaatcccctcagaatggcacattgaactgatgttagacagagggatcccagtggagctg tgggcacattatgtagaagagctcaacagcactcagagggtggccgtggaggactcggtt tttcttgtattttccttgaaaaaatttatttatgcactgaaagctcctaaatcttttcct aaaggtgatatatggtggaatcctgaacaactgaaagaagacagcagggactatctgcac ttgctcattgggctgtttgagatgatgctcaatggtgccgatgctgttcatttcagagtt ctgatgaaacttttcataaaggtgcatctagaagatgtttttcagttattcaagttctgt tctgttttatggacctatggttctagcctttcaaatccactaaactgcagtgtgaaaaca gtgctgcagactcaagctctttatgtgggctgtgcaatgctttcttctcagaagacacag tgtaaacaccaactggcatccatatcttctccagtggtgacatctttactcattaacctg ggaagccccgtaaaagaagttcgtagggctgccattcagtgtctccaggccctcagtgga gtggcatccccgttttatctgataatagatcatttgatttctaaagcagaggagatcact tcagatgctgcctatgttattcagatggtgctttctcagctattgcctatggctgaacaa ctgctagaaaagatccagaaggagcccacagctgtgctgaaagatgaggccatggttctg catctcactctgggaaagtataatgaattttcagtttcccttttaaatgaggatccgaag agtctagatatatttataaaagctgtgcacacaacaaaggaactttacgcgggaatgcca accattcagatcacagcccttgaaaagattacaaaaccattttttgcagccatatcagat gaaaaagttcagcagaagcttttaagaatgttgtttgatttattggtgaactgtaaaaac tcacattgtgctcagactgtcagcagtgtttttaaagggatttccgttaatgctgaacaa gtccgaatagaactggagccaccagataaagctaaacccttgggcacagttcagcaaaaa agaaggcaaaaaatgcagcagaaaaaatcacaagatctagaatctgttcaggaagttgga ggttcttactggcaaagagtaactctcatcctggaattactgcagcacaaaaagaagctc agaagtcctcagatattggtgccaactctttttaacttgctatcaagatgtttagaaccc ttgccacaagagcagggaaatatggaatacaccaaacaattaattcttagttgtctgctc aacatctgccaaaaactatctccagatggtggcaaaatacccaaagatattttagatgag gagaagttcaacgtggagttgatagttcagtgcatccgcctttcggagatgccgcagacc catcaccatgcccttttacttttgggcactgttgctggaatatttccggataaagtttta cacaatatcatgtctatttttacatttatgggagccaatgtcatgcgcctagatgatact tacagttttcaagttattaacaagacagtgaaaatggttattcccgcacttattcagtct gatagtggagattctatagaagtttcaagaaacgttgaagagattgtggtaaaaatcatt agtgtatttgtggatgcgctgccacacgtcccggagcacaggcgcctgcccatccttgtt caacttgttgatacactgggtgcagagaaattcctctggattctcctcatcttgcttttt gaacagtatgtcacaaaaacagtgctggcggctgcctatggcgaaaaggatgctatttta gaagcagacactgaattttggttttcagtctgttgtgagtttagtgtccagcatcagata caaagcttgatgaatatcctccagtacttactaaagctgccagaggaaaaagaagaaacc attcccaaagcagtgtcatttaataagagtgaatcacaagaagaaatgctacaggttttt aatgtagagactcacactagcaagcaactgcggcattttaaatttttgtcagtgtccttc atgtctcagctcctgtcttccaataattttctgaaaaagctcccccctcaggctggcagc agtggggggatctccagtggcagctgtgttggaatttggttgctggagaccgttctcggc tatatcagtgcagttgcacagtccatggaaaggaacgcagacaaactcaccgtgaagttc tggcgcgcgctccttagtaaagcttacgacctgttagataaggtcaatgccttgctgccc acagagacattcattcctgtgatcagagggctggtgggcaatcccctgccatctgttcgc cgcaaagcgctggaccttttgaataacaagctgcagcaaaatatatcctggaagaagaca atagttacccgtttcctaaaactggttccagaccttttggccattgtgcagcgtaagaaa aaggaaggggaagaagaacaagcaatcaacagacagacagcgttgtataccttaaagctt ttatgcaagaattttggtgcagaaaatccagatccttttgtcccagtgctgaacactgct gtgaaactgattgctccagagagaaaggaggagaagaatgtcctgggaagcgcgctgctg tgcatagcagaggtgacctccaccctggaggcgctggccatcccccagcttcccagcctg atgccatcgttgctgacaacaatgaagaacaccagcgagctggtctccagcgaggtctac ctgctcagtgccttggctgctctgcagaaggttgtggagactctcccgcacttcatcagc ccctatctggaaggcattctctcccaggtgattcatctggagaaaatcactagtgaaatg ggttctgcgtcacaggctaatatccgtctcacatctcttaaaaagacactggctaccaca cttgcaccccgagtcctgttgcccgccatcaaaaaaacttacaagcagattgagaagaac tggaagaatcacatgggtccgtttatgagcatcttgcaagagcatattggggtgatgaag aaggaagagctcacctcccatcagtctcagctaaccgcctttttcctggaagccctggac ttccgagcccagcactctgagaacgatctggaggaagttggaaaaacggaaaattgtatc attgactgtctagtagccatggttgtcaaactttccgaggtcacattcaggcccctgttc ttcaagctgtttgattgggctaaaacagaagatgccccaaaggacaggttgttgacattt tacaacttggcagattgcattgctgaaaagctgaaagggctttttactctgtttgccggc cacttagtgaagccttttgctgacaccttgaaccaggtgaacatctccaaaacagatgaa gcattttttgactctgaaaatgaccctgaaaagtgctgcttgctgttgcagtttattttg aactgtttatacaaaatcttcctttttgatacccagcattttataagtaaagagagagca gaagccttgatgatgcctctggtggatcagctggaaaacaggcttgggggagaagagaaa ttccaggaacgggtgacaaagcacctgataccatgcatcgcacagttttcggtggccatg gcggatgactctctttggaaaccactgaactaccagattctgctaaagacgagagactcc tcgcctaaggttcgatttgctgctttgattactgtgttagcactggctgaaaaactaaag gagaattatattgtcttgctaccagaatccattcctttcttagcagagttgatggaagat gaatgtgaagaagtagaacatcagtgccaaaagactattcagcaactggaaactgtcctg ggagagccactccagagctatttctaa