GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:31:12 Sequence gi568815592f:37070576_37274088 : 203513 bp : 45.66% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1092 1131 40 -2.86 1.01 Sngl + 7123 7650 528 0 0 83 36 235 0.687 12.06 1.02 PlyA + 7964 7969 6 1.05 2.00 Prom + 20622 20661 40 -5.76 2.01 Sngl + 20737 21231 495 0 0 82 47 558 0.983 47.15 2.02 PlyA + 21251 21256 6 1.05 3.00 Prom + 35185 35224 40 -4.36 3.01 Init + 41422 41446 25 0 1 64 111 15 0.071 1.09 3.02 Term + 60654 60892 239 1 2 127 41 112 0.899 6.73 3.03 PlyA + 61495 61500 6 1.05 4.00 Prom + 62063 62102 40 -4.46 4.01 Init + 65086 65088 3 0 0 87 101 0 0.032 1.20 4.02 Intr + 99614 100082 469 1 1 21 97 516 0.786 38.48 4.03 Intr + 100198 100304 107 2 2 113 117 215 0.991 26.83 4.04 Intr + 100367 100456 90 1 0 -24 105 174 0.898 8.09 4.05 Intr + 100550 100916 367 1 1 119 89 840 0.978 81.92 4.06 Intr + 102421 102597 177 2 0 81 93 152 0.965 14.89 4.07 Term + 103359 103516 158 1 2 132 43 28 0.910 0.80 4.08 PlyA + 104789 104794 6 1.05 5.03 PlyA - 104903 104898 6 1.05 5.02 Term - 142383 141882 502 2 1 -38 42 593 0.003 36.05 5.01 Init - 148455 147851 605 0 2 65 92 414 0.058 34.28 5.00 Prom - 150341 150302 40 -11.82 6.00 Prom + 150370 150409 40 -4.76 6.01 Init + 151741 152218 478 0 1 72 117 140 0.590 11.12 6.02 Term + 175073 175191 119 0 2 -21 38 324 0.846 15.20 6.03 PlyA + 176793 176798 6 1.05 7.03 PlyA - 180622 180617 6 1.05 7.02 Term - 187221 187103 119 1 2 35 48 189 0.778 8.30 7.01 Intr - 187865 187835 31 2 1 87 114 -2 0.226 0.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:37070576_37274088|GENSCAN_predicted_peptide_1|175_aa MVLRLLPVPALPKSKSTVLGVLSTCLSARPAPFSLKGLCQPGHDSAASRGARASPASKQH TVCGQPPTTFPLPDVSEEPWLPWSPHGQPCPDMPLPLCCPALHSVSRRPDVPALVLAALN EVQMCSLARASSGCYQRRGRPCEGAARLQLAAAVLEEECSGKDSHWNVGSRDVSA >gi568815592f:37070576_37274088|GENSCAN_predicted_CDS_1|528_bp atggtcctgagactgctcccagttccagccctccccaagagcaagagcactgtgcttgga gtcctgagtacctgtctctctgccagacctgcccccttcagcctaaaaggcctctgccag ccagggcacgactctgctgcatccagaggggccagagccagccccgccagcaagcagcac accgtctgcggccagccgcccaccaccttcccacttccagatgtctcagaggagccctgg ctgccttggagcccgcatgggcagccctgtcctgacatgcccctgcccctttgctgccct gccttgcactctgtctcgcggcggccagatgtgccggctttggtgctggcggctcttaat gaggtccagatgtgcagcttggcccgggccagcagcggctgctaccaaaggagagggagg ccctgtgagggagcagccaggctgcagctggccgcggcggtgctggaggaggaatgctct gggaaggattcccactggaatgtgggctctcgggatgtctctgcctaa >gi568815592f:37070576_37274088|GENSCAN_predicted_peptide_2|164_aa MWPKFNPSEIKVVYLRCTGGEVSATSALGPKIGPLDLSPKKVGDDIAKATGDWKGLRITV KLTIENRQAQIEVVPSASALIIKALKEPRDRKKQKNIKHSGNITFDEIVNIAPRMRHRSL ARDLTGTIKEILGTAQSVGCNVDGRHPHDIIDDINSGAVECPAS >gi568815592f:37070576_37274088|GENSCAN_predicted_CDS_2|495_bp atgtggccgaagttcaaccccagcgagatcaaagtcgtatacctgaggtgcactgggggt gaagtcagtgccacgtctgcgctgggccccaagatcggccccctggacctgtctccaaaa aaggttggtgatgacattgccaaggcaacgggtgactggaagggcctgaggattacagtg aaactgaccattgagaacagacaggcccagattgaggtggtgccttctgcttctgccctg atcatcaaagcccttaaggaaccaagagacagaaagaaacagaaaaacattaaacacagt gggaatatcacttttgatgagatcgtcaacattgctccacggatgcggcaccgatcttta gccagagatctcactggaaccattaaagagatcctggggactgcccagtctgtgggctgc aatgttgatggccgccaccctcatgacatcatagatgacatcaacagtggtgctgtggaa tgcccagctagttaa >gi568815592f:37070576_37274088|GENSCAN_predicted_peptide_3|87_aa MTIIGVSFGLLRAGSATGINKANNRTLHGHPSSRPESRGLRRLLEDLEQVHITCCHHKCC PGFCSLCPVTEEFLNGSSELTKKKMSP >gi568815592f:37070576_37274088|GENSCAN_predicted_CDS_3|264_bp atgacaatcattggtgtgtcctttgggcttctaagagctggcagcgccacagggataaat aaggccaacaaccgcaccttacatggccatcccagcagccgcccggaatcccgaggatta cgcagacttctggaggacttggaacaagtgcacatcacctgctgccaccacaagtgttgc ccgggcttctgcagcctctgtcctgtgactgaggaatttttaaatggaagctctgagctc acaaagaagaaaatgtctccctga >gi568815592f:37070576_37274088|GENSCAN_predicted_peptide_4|456_aa MRPRWLRRPERSRWQRRRRDRQQQQQQQQQQPLASCPAALPHEPHEPLTPPFSALPDPAG APSRRQSRQRPQLSSDSPSAFRASRSHSRNATRSHSHSHSPRHSLRHSPGSGSCGSSSGH RPCADILEVGMLLSKINSLAHLRAAPCNDLHATKLAPGKEKEPLESQYQVGPLLGSGGFG SVYSGIRVSDNLPPGRGNLTETLGFQVAIKHVEKDRISDWGELPNGTRVPMEVVLLKKVS SGFSGVIRLLDWFERPDSFVLILERPEPVQDLFDFITERGALQEELARSFFWQVLEAVRH CHNCGVLHRDIKDENILIDLNRGELKLIDFGSGALLKDTVYTDFDGTRVYSPPEWIRYHR YHGRSAAVWSLGILLYDMVCGDIPFEHDEEIIRGQVFFRQRVSSECQHLIRWCLALRPSD RPTFEEIQNHPWMQDVLLPQETAEIHLHSLSPGPSK >gi568815592f:37070576_37274088|GENSCAN_predicted_CDS_4|1371_bp atgcggccgcggtggctgaggaggcccgagaggagtcggtggcagcggcggcggcgggac cggcagcagcagcagcagcagcagcagcagcaaccactagcctcctgccccgcggcgctg ccgcacgagccccacgagccgctcaccccgccgttctcagcgctgcccgaccccgctggc gcgccctcccgccgccagtcccggcagcgccctcagttgtcctccgactcgccctcggcc ttccgcgccagccgcagccacagccgcaacgccacccgcagccacagccacagccacagc cccaggcatagccttcggcacagccccggctccggctcctgcggcagctcctctgggcac cgtccctgcgccgacatcctggaggttgggatgctcttgtccaaaatcaactcgcttgcc cacctgcgcgccgcgccctgcaacgacctgcacgccaccaagctggcgcccggcaaggag aaggagcccctggagtcgcagtaccaggtgggcccgctactgggcagcggcggcttcggc tcggtctactcaggcatccgcgtctccgacaacttgccgcccggacgagggaacctgacg gagaccctgggcttccaggtggccatcaaacacgtggagaaggaccggatttccgactgg ggagagctgcctaatggcactcgagtgcccatggaagtggtcctgctgaagaaggtgagc tcgggtttctccggcgtcattaggctcctggactggttcgagaggcccgacagtttcgtc ctgatcctggagaggcccgagccggtgcaagatctcttcgacttcatcacggaaagggga gccctgcaagaggagctggcccgcagcttcttctggcaggtgctggaggccgtgcggcac tgccacaactgcggggtgctccaccgcgacatcaaggacgaaaacatccttatcgacctc aatcgcggcgagctcaagctcatcgacttcgggtcgggggcgctgctcaaggacaccgtc tacacggacttcgatgggacccgagtgtatagccctccagagtggatccgctaccatcgc taccatggcaggtcggcggcagtctggtccctggggatcctgctgtatgatatggtgtgt ggagatattcctttcgagcatgacgaagagatcatcaggggccaggttttcttcaggcag agggtctcttcagaatgtcagcatctcattagatggtgcttggccctgagaccatcagat aggccaaccttcgaagaaatccagaaccatccatggatgcaagatgttctcctgccccag gaaactgctgagatccacctccacagcctgtcgccggggcccagcaaatag >gi568815592f:37070576_37274088|GENSCAN_predicted_peptide_5|368_aa MKQQQWCGMTAKMGTVLSGVFTIMAVDMYLIFEQKHLGNGSCTEITPKYRGASNIINNFI ICWSFKIVLFLSFITILISCFLLYSVYAQIFRGLVIYIVWIFFYETANVVIQILTNNDFD IKEVRIMRWFGLVSRTVMHCFWMFFVINYAHITYKNRSQGNIISYKRRISTAEILHSRNK RLSISSGFSGSHLESQYFERQRMFSLMVGIFSVLNTTQFFIFDLNQKTHICYEAKFSIYV DSKSELVTWTLFHRANISTGLSLTTIIIGCFLFYCIHKNIYMGLLIYAMWIITYELINFS IVLLLNGIIKDHFKTLSYLHWIFQISHMLLHFFCLPFIVKHAYNLYKESQTVGRKRRHRL CSTIAVNS >gi568815592f:37070576_37274088|GENSCAN_predicted_CDS_5|1107_bp atgaaacagcagcagtggtgtgggatgactgccaaaatgggcaccgtgttgtcaggggtc ttcaccatcatggccgtagacatgtatctcatctttgaacagaagcacctagggaatggc agttgcactgagatcacaccaaagtacaggggtgcaagtaacatcataaataacttcatc atctgctggagttttaaaatcgtcctcttcctgtctttcatcaccatcctcatcagctgc ttcctcctgtactcagtgtatgcccagatcttcaggggcctggtcatctacattgtctgg atttttttctatgaaactgcaaacgtcgtaatacaaatcctcaccaacaatgactttgac attaaagaggtcagaatcatgcgctggtttggcttggtgtctcgtacagtcatgcactgt ttctggatgttctttgtcatcaactatgcccacataacctacaaaaaccggagccagggc aatataatttcctacaagagacgaatttctacagcggagattctccacagcagaaataaa agattatcaatttcgagtgggttcagtggctcacacctggaatcccagtactttgagagg cagaggatgttctccctcatggtgggcatcttctctgtccttaataccacccagttcttc atctttgacctgaaccagaagacacacatttgctatgaggccaagttcagcatctacgtg gactcaaagtcggagctagtcacttggaccctgttccacagggctaatatcagcactggc ctctccctcaccaccatcatcatcggctgcttcctcttttattgtatccacaagaatatc tacatggggctgctgatctatgccatgtggatcatcacttacgagctcatcaacttctcc atagtcctgctcctcaacgggatcatcaaagatcacttcaagacgctgagttatttgcac tggatcttccaaatctcacacatgctcctgcactttttctgtctgcccttcatcgtcaag catgcatacaacctttacaaggaatcccagactgtgggcaggaaacgccgccacaggctc tgctccaccattgcagtgaactcatga >gi568815592f:37070576_37274088|GENSCAN_predicted_peptide_6|198_aa MAPPGLDPNPAPRFELVPGLERGQAAGADIREPAGTAGSRGRGPPGVPEGAGCRDARVLH LGGPAPLTRKGRVSYLSLAPACSVKWEAQVCSRGSWQLQLHLGGQILPIPGPPPRAQGGF DAQPQFGQLQPAQEGGTSACSVEPEVGSTALVWAAAVAPAKTPTQNNSDNKEEEEQEEEE EEEQEEDEEEEEMEEEIY >gi568815592f:37070576_37274088|GENSCAN_predicted_CDS_6|597_bp atggcccccccagggctcgaccccaacccggctccgagatttgaactggtgccgggattg gagagaggccaggcagcgggagcagacatccgggagcctgcagggacggcaggaagccgg ggaagggggcctcctggggtccccgagggtgcaggctgcagagacgcccgggtcctgcac ctaggaggtcccgccccgctaactcgaaaggggcgggtctcctacttgtccctggctcct gcctgctcagtgaagtgggaggcccaggtctgcagccgtgggtcctggcagctgcagctg cacctgggagggcagatcctgcctattcccggccctcctccaagagcacagggaggcttc gatgcacagccacagtttgggcagctgcagcccgcccaagagggcgggacttctgcctgc tccgtagagccggaggtggggtctacggctttggtttgggcggctgcagtagcacccgca aaaacaccaacacagaacaacagtgacaataaggaagaggaggagcaggaggaggaggag gaggaagagcaggaggaggacgaggaggaggaggagatggaggaggagatctactag >gi568815592f:37070576_37274088|GENSCAN_predicted_peptide_7|49_aa AAPNSHRNPSAPGDAILDVGTPPPSTAAIGELRKPWTGRPIGSRNPAGR >gi568815592f:37070576_37274088|GENSCAN_predicted_CDS_7|150_bp gcggccccaaactcccaccggaatccttcggctcctggggacgccatcttggatgtgggc acccctccacccagcaccgctgccattggtgagctgcgcaagccgtggacgggccgacca attgggagccggaatcctgcaggccgctga