GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:53:53 Sequence gi568815589r:122141887_122361492 : 219606 bp : 45.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 217 212 6 1.05 1.04 Term - 2492 2355 138 2 0 110 43 50 0.702 0.66 1.03 Intr - 6391 6226 166 0 1 68 93 111 0.900 9.56 1.02 Intr - 10522 10359 164 1 2 112 66 137 0.991 12.67 1.01 Init - 17791 17741 51 1 0 103 78 73 0.962 8.96 1.00 Prom - 18368 18329 40 -8.16 2.00 Prom + 21473 21512 40 -4.36 2.01 Init + 22696 22761 66 0 0 64 116 41 0.614 5.57 2.02 Intr + 23832 23932 101 2 2 68 51 19 0.477 -4.89 2.03 Intr + 24882 25029 148 0 1 114 53 102 0.714 9.54 2.04 Intr + 27759 27870 112 2 1 126 80 197 0.995 22.65 2.05 Intr + 32610 32741 132 1 0 80 96 115 0.990 12.12 2.06 Intr + 46582 46714 133 2 1 64 20 113 0.080 1.80 2.07 Term + 50170 50329 160 1 1 105 45 67 0.100 1.51 2.08 PlyA + 52228 52233 6 1.05 3.20 PlyA - 52743 52738 6 1.05 3.19 Term - 62894 62815 80 0 2 143 50 64 0.915 6.03 3.18 Intr - 63244 63187 58 1 1 79 56 28 0.555 -2.84 3.17 Intr - 67831 67728 104 2 2 103 94 98 0.729 11.79 3.16 Intr - 71894 71720 175 2 1 44 89 185 0.983 13.71 3.15 Intr - 72183 72088 96 0 0 129 75 197 0.998 22.71 3.14 Intr - 72550 72397 154 0 1 105 86 125 0.526 14.07 3.13 Intr - 72883 72832 52 2 1 82 44 30 0.457 -4.03 3.12 Intr - 75402 75182 221 2 2 78 32 511 0.930 42.45 3.11 Intr - 78060 77919 142 1 1 46 18 148 0.580 3.01 3.10 Intr - 78673 78528 146 0 2 23 68 71 0.287 -1.47 3.09 Intr - 84611 84490 122 2 2 116 96 176 0.987 20.49 3.08 Intr - 85144 84962 183 1 0 121 100 162 0.977 20.68 3.07 Intr - 85594 85523 72 1 0 110 101 18 0.958 4.90 3.06 Intr - 86939 86771 169 1 1 102 99 202 0.551 22.65 3.05 Intr - 104935 104718 218 1 2 54 89 105 0.673 4.50 3.04 Intr - 105718 105632 87 1 0 112 102 83 0.979 12.27 3.03 Intr - 106894 106796 99 1 0 87 7 82 0.388 0.31 3.02 Intr - 110087 109961 127 1 1 118 91 83 0.982 12.28 3.01 Init - 119606 119494 113 2 2 32 91 93 0.431 3.78 3.00 Prom - 119833 119794 40 -6.16 4.00 Prom + 119989 120028 40 -5.96 4.01 Init + 122630 122769 140 1 2 81 58 89 0.208 4.90 4.02 Intr + 128978 129189 212 2 2 64 75 78 0.044 2.66 4.03 Intr + 138557 138712 156 0 0 59 92 74 0.634 4.88 4.04 Intr + 143283 143401 119 1 2 87 94 111 0.989 11.78 4.05 Intr + 149863 149954 92 0 2 87 100 75 0.632 7.39 4.06 Intr + 171341 171500 160 2 1 77 79 199 0.643 17.89 4.07 Term + 180654 180731 78 2 0 108 48 73 0.884 3.06 4.08 PlyA + 181155 181160 6 1.05 5.05 PlyA - 181715 181710 6 1.05 5.04 Term - 200115 200021 95 1 2 15 49 119 0.408 -1.31 5.03 Intr - 200623 200570 54 2 0 56 80 56 0.405 0.55 5.02 Intr - 201358 201273 86 0 2 25 89 67 0.571 -0.04 5.01 Init - 205196 204829 368 2 2 67 43 291 0.557 17.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 53825 53768 58 2 1 48 111 40 0.818 3.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:122141887_122361492|GENSCAN_predicted_peptide_1|172_aa MPGIVELPTLEELKVDEVKISSAVLKAAAHHYGAQCDKPNKEFMLCRWEEKDPRRCLEEG KLVNKCALDFFRQIKRHCAEPFTEYWTCIDYTGQQLFRHCRKQQAKFDECVLDKLGWVRP DLGELSKVTKVKTDRPLPENPYHSRPRPDPSPEIEGDLQPATHGSRFYFWTK >gi568815589r:122141887_122361492|GENSCAN_predicted_CDS_1|519_bp atgccggggatagtggagctgcccactctagaggagctgaaagtagatgaggtgaaaatt agttctgctgtgcttaaagctgcggcccatcactatggagctcaatgtgataagcccaac aaggagtttatgctctgccgctgggaagagaaagatccgaggcggtgtttagaggaaggc aaactggtcaacaagtgtgctttggacttctttaggcagataaaacgtcactgtgcagag ccttttacagaatattggacttgcattgattatactggccagcagttatttcgtcactgt cgcaaacagcaggcaaagtttgacgagtgtgtgctggacaaactgggctgggtgcggcct gacctgggagaactgtcaaaggtcaccaaagtgaaaacagatcgacctttaccggagaat ccctatcactcaagaccaagaccggatcccagccctgagatcgagggagatctgcagcct gccacacatggcagccgcttttatttctggaccaagtaa >gi568815589r:122141887_122361492|GENSCAN_predicted_peptide_2|283_aa MSSVLGELEMSVRGLNSDVQFKTVSPPRTGTASALITVESVLYSASWTISIVPDKMMEGK AKYILPTETIYVGEMKDGMFHGEGTLYFPSGSQYDAIWENGLAIKGTYTFSDGLHYDEKN WHYCDGYDRRFYTEILNGLKPAGMAQLTNMDPPRKIPKGYYDCGDGFYNPVTRVVKDYRN RFLRNAVLIQPSTFANWTTGSRLPGPLRSIVHLQPGIRGQKKQRVNAEAVGKSGETCYKD EEGGSEMGVTCPKSPSESVAQHRIQACSAPELLLVFAVLPHEP >gi568815589r:122141887_122361492|GENSCAN_predicted_CDS_2|852_bp atgagttctgttttaggcgagttggagatgtctgtgagaggtctgaacagtgatgtccag ttcaagactgtgagccccccgaggacagggaccgcatctgccttgatcactgttgaatct gtgttgtattcagcaagttggaccatcagcatagtccctgacaaaatgatggagggcaaa gccaagtacatcctccctaccgaaacaatatatgttggggaaatgaaggatggcatgttt cacggcgagggaaccctgtacttccccagcggaagccaatacgacgccatttgggaaaac ggattggccataaagggcacatatacgttctcagatgggctgcactatgatgagaaaaac tggcattactgcgacggctatgatcggaggttttacacagagatcctcaatggcttgaag cctgcaggtatggctcaactcaccaatatggacccacctagaaaaatccccaagggctat tacgattgtggagacggcttctataacccagtcacgagggtagtcaaggactataggaac cgctttctaagaaacgcagtcttaattcagccttcaacatttgccaactggacaacaggc tcccgcctgccggggccccttcggtccatcgtccacttgcagccagggattaggggccaa aagaagcagagggtcaacgcagaggctgtgggaaagtctggtgaaacctgctacaaagac gaggagggtggctcagagatgggagtgacgtgtccaaaatcacccagcgagtcggtggct cagcatcggatccaggcttgttcggctccagaactccttctagtctttgctgtgttgcca catgagccctga >gi568815589r:122141887_122361492|GENSCAN_predicted_peptide_3|805_aa MEAETKTLPLENASILSEGSLQEGHRLWIGNLDPKITEYHLLKLLQKFGKVKQFDFLFHK SGALEGQPRGYCFVNFETKQGFFTLTKHGLHLAECLMDANGAKKVWQDGKYSQEAEQAIQ CLNGKLALSKKLVVRWAHAQVKWGLEKPACTKICDSFELELKSLALSMVFDQRLSTPLPN KLPSKESQRLNKTFKLNVQAEKLSSDTLDPSEYFSSLCSHCGCGPPRRSSPRCAPAEPEV PRPMYWKHENAAPALPEGCRLPAEGGPATDQVMAQPGSGCKATTRCLEGTAPPAMAQSDA EALAGALDKDEGQASPCTPSTPSVCSPPSAASSVPSAGKNICSSCGLEILDRYLLKVNNL IWHVRCLECSVCRTSLRQQNSCYIKNKEIFCKMDYFSFVVSAADFAVSGIQWLRPRSPPG KESVVLSEPGSISLLAPFVLLTPVLGAWALGPALEIPESAARREDAALPDQPRFLFIHAG FQLSSSPVQLLRSRFGTKCARCGRQIYASDWVRRARGNAYHLACFACFSCKRQLSTGEEF GLVEEKVLCRIHYDTMIENLKRAAENAFGVPQFRQQGAQFGTLGLPLEEMGTRGQLLLSP PGNGLTLEGAVPSEQDSQPKPAKRARTSFTAEQLQVMQAQFAQDNNPDAQTLQKLADMTG LSRRVIQVWFQNCRARHKKHTPQHPVPPSGAPPSRLPSALSDDIHYTPFSSPERARMVTL HGYIESQVQCGQVHCRLPYTAPPVHLKADMDGPLSNRGEKNVGTPSRGYGAASVKSGTEG HPFSVLTLPALPHLPVGAPQLPLSR >gi568815589r:122141887_122361492|GENSCAN_predicted_CDS_3|2418_bp atggaagcagaaaccaaaactcttcccctggagaatgcatccatcctttcagagggctct ctgcaggaaggacaccgattatggattggcaacctggaccccaaaattaccgaataccac ctcctcaagctcctccagaagtttggcaaggtaaagcagtttgacttcctcttccacaag tcaggtgctttggagggacagcctcgaggctactgttttgttaactttgaaactaagcag ggcttcttcaccctcactaaacatgggttacacttggctgaatgcctcatggatgcaaat ggggcaaagaaggtttggcaggatggaaagtacagccaggaagcagagcaagccatccag tgtctcaatggcaagttggccctgtccaagaagctggtggtgcgatgggcacatgctcaa gtaaagtggggacttgaaaaacctgcctgcacaaagatttgtgacagttttgagttggag ctgaaatctttagctctaagcatggtttttgatcagaggctctctaccccactccccaac aaacttcccagtaaagagtcacagagactaaacaagacatttaagctcaatgtccaagct gaaaaactaagttcagatactttagatccttctgagtatttcagcagcctctgctcccac tgcggctgtggtccccctcggcgcagctctccgcgctgcgcgcccgctgagcccgaggtt ccccggcccatgtactggaagcatgagaacgccgccccggcgttgcccgagggctgccgg ctgccggccgagggcggccccgccaccgaccaggtgatggcccagccagggtccggctgc aaagcgaccacccgctgtcttgaagggaccgcgccgcccgccatggctcagtctgacgcc gaggccctggcaggagctctggacaaggacgagggtcaggcctccccatgtacgcccagc acgccatctgtctgctcaccgccctctgccgcctcctccgtgccgtctgcaggcaagaac atctgctccagctgcggcctcgagatcctggaccgatatctgctcaaggtcaacaacctc atctggcacgtgcggtgcctcgagtgctccgtgtgtcgcacgtcgctgaggcagcagaac agctgctacatcaagaacaaggagatcttctgcaagatggactacttcagtttcgttgtc tccgccgccgatttcgctgtgtctggaatccagtggctccggcctcgcagcccgcctggg aaggagtctgtcgttctgtcggagcctggatcgataagcctcctcgcgcccttcgtgctg ctgactcctgtcctgggagcgtgggcccttggccccgcgctggagattccagaatcagcg gcccggagagaggacgcggcgctgccagaccagccccgcttcctgttcatccatgcggga ttccagctttcctcgtcgccagtccagttgctccgcagccgattcgggaccaagtgtgcc cggtgcggccgacagatctacgccagcgactgggtgcggagagctcgcggcaacgcctac cacctggcctgcttcgcctgcttctcgtgcaagcgccagctgtccactggtgaggagttc ggcctggtcgaggagaaggtgctctgccgcatccactacgacaccatgattgagaacctc aagagggccgccgagaacgcctttggtgtcccccagtttcgccagcagggagcccagttt gggacattagggctgccactagaagagatggggacgcgtggtcagctgcttctatcgccc ccagggaacggcctcacgttggagggggcagtgccctcggaacaggacagtcaacccaag ccggccaagcgcgcgcggacgtccttcaccgcggaacagctgcaggttatgcaggcgcag ttcgcgcaggacaacaaccccgacgctcagacgctgcagaagctggcggacatgacgggc ctcagccggagagtcatccaggtgtggtttcaaaactgccgggcgcgtcataaaaagcac acgccgcaacacccagtgccgccctcgggggcgcccccgtcccgccttccctccgccctg tccgacgacatccactacaccccgttcagcagccccgagcgggcgcgcatggtcaccctg cacggctacattgagagtcaggtacagtgcgggcaggtgcactgccggctgccttacacc gcaccccccgtccacctcaaagccgatatggatgggccgctctccaaccggggtgagaag aacgttggaactccctcacggggctatggtgcggcttccgtgaaatcaggcacagaaggt catcctttttcagtactaacgctgccggcacttccgcatctgcccgtgggcgccccacag ctgcccctcagccgctga >gi568815589r:122141887_122361492|GENSCAN_predicted_peptide_4|318_aa MPGTLASRFLAANSYRRRVITRIVISGPATGQSGAGPEMQSCEGGLLGCFQGLSSVMALG LKCFRMVHPTFRNYLAASIRPVSEVTLKTVHERQHGHRQYMAYSAVPVRHFATKKAKAKG KGQSQTRVNINAALVEDIINLEEVNEEMKSVIEALKDNFNKTLNIRTSPGSLDKIAVVTA DGKLALNQISQISMKSPQLILVNMASFPECTAAAIKAIRESGMNLNPEVEGTLIRVPIPQ VTREHREMLVKLAKQNTNKAKDSLRKVRTNSMNKLKKSKDTVSEDTIRLIEKQISQMADD TVAELDRHLAVKTKELLG >gi568815589r:122141887_122361492|GENSCAN_predicted_CDS_4|957_bp atgcctggaaccctggcttcccgatttctggctgcgaattcctacaggcgaagggtcatt actcggattgttatttccgggccagcgacaggacaatccggggcggggcctgagatgcag tcctgcgagggcggcctccttggatgtttccaaggattgtcttcagtcatggccttggga ttaaagtgcttccgcatggtccaccctacctttcgcaattatcttgcagcctctatcaga cccgtttcagaagttacactgaagacagtgcatgaaagacaacatggccataggcaatac atggcctattcagctgtaccagtccgccattttgctaccaagaaagccaaagccaaaggg aaaggacagtcccaaaccagagtgaatattaatgctgccttggttgaggatataatcaac ttggaagaggtgaatgaagaaatgaagtctgtgatagaagctctcaaggataatttcaat aagactctcaatataaggacctcaccaggatcccttgacaagattgctgtggtaactgct gacgggaagcttgctttaaaccagattagccagatctccatgaagtcgccacagctgatt ttggtgaatatggccagcttcccagagtgtacagctgcagctatcaaggctataagagaa agtggaatgaatctgaacccagaagtggaagggacgctaattcgggtacccattccccaa gtaaccagagagcacagagaaatgctggtgaaactggccaaacagaacaccaacaaggcc aaagactctttacggaaggttcgcaccaactcaatgaacaagctgaagaaatccaaggat acagtctcagaggacaccattaggctaatagagaaacagatcagccaaatggccgatgac acagtggcagaactggacaggcatctggcagtgaagaccaaagaactccttggatga >gi568815589r:122141887_122361492|GENSCAN_predicted_peptide_5|200_aa MRASCPPGRAAARSLPALRPPGNDSSPGPPRGYTGSRPGRRTGPPTEGRRARVRGDLATG KRQRAGGRHVFRRAHKPSAGRTPCNMRVCAMHLSLHCTLHVPHTAALPAPTSTCCFHTWW LSGGEKVALGFTAGGQEIIVDTKPKRPLLGPEEYAHGPGIPGDHSLEEGGSSSLNHKIEI QKASVTKKVFHRSDATSDLD >gi568815589r:122141887_122361492|GENSCAN_predicted_CDS_5|603_bp atgcgcgcttcctgcccgcccggccgggccgcggcccgctcactccccgccctgcggccg ccgggaaatgacagcagcccgggaccgccgcgcgggtacacggggtcgcgccccgggaga cggacggggccgcccaccgagggccgccgggcgagggtgcgaggtgacctagcgacgggc aagcggcagcgggcaggcggccgtcatgtattcaggcgcgcgcacaagccttctgccggt cgcaccccttgtaacatgcgtgtctgtgccatgcacctaagtctccactgcacgctgcac gtgccccacacagccgcattgcctgcgccgacatccacgtgttgcttccacacgtggtgg ctgagtgggggagagaaggtggcccttggcttcacggctggtggccaggaaattatagtc gacaccaaaccaaagcggcctcttctagggccagaagaatatgcacatggccccggcatt cctggagaccacagtctggaggagggaggctcatcatcccttaaccacaagattgaaatt caaaaagcttcagtgaccaaaaaggttttccacaggtctgatgccacatctgacctggac taa