GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:37:53 Sequence gi568815595f:10716266_11038399 : 322134 bp : 47.52% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 613 669 57 0 0 99 46 28 0.464 1.01 1.02 Term + 695 868 174 1 0 78 48 129 0.972 5.66 1.03 PlyA + 2635 2640 6 1.05 2.07 PlyA - 4304 4299 6 1.05 2.06 Term - 24927 24850 78 0 0 0 55 272 0.973 12.86 2.05 Intr - 26190 25864 327 0 0 60 -19 208 0.772 3.19 2.04 Intr - 48752 48610 143 0 2 48 88 65 0.061 2.57 2.03 Intr - 54599 54491 109 2 1 56 80 57 0.038 1.56 2.02 Intr - 59434 59340 95 2 2 70 67 28 0.048 -1.42 2.01 Init - 64442 64262 181 2 1 57 96 194 0.915 16.44 2.00 Prom - 72959 72920 40 -3.86 3.00 Prom + 85014 85053 40 -3.86 3.01 Init + 100001 100256 256 1 1 108 78 434 0.957 41.79 3.02 Intr + 103200 103334 135 1 0 105 95 29 0.973 5.84 3.03 Intr + 103447 103587 141 2 0 123 80 117 0.998 14.72 3.04 Intr + 107037 107127 91 1 1 81 83 80 0.666 5.85 3.05 Intr + 117960 118066 107 0 2 96 100 30 0.390 4.86 3.06 Intr + 118779 118885 107 1 2 93 80 -9 0.444 -1.27 3.07 Intr + 119342 119518 177 1 0 71 36 104 0.412 3.62 3.08 Intr + 124216 124277 62 0 2 95 88 11 0.766 -0.67 3.09 Intr + 127949 128081 133 2 1 133 116 132 0.967 21.15 3.10 Intr + 135346 135399 54 0 0 90 86 13 0.254 0.48 3.11 Intr + 158696 158830 135 1 0 114 110 152 0.997 20.76 3.12 Intr + 179629 179740 112 0 1 51 94 57 0.001 2.55 3.13 Intr + 186838 186929 92 2 2 70 57 77 0.003 2.51 3.14 Intr + 195825 195928 104 2 2 118 99 52 0.992 8.27 3.15 Intr + 202064 202188 125 2 2 123 111 120 0.819 18.03 3.16 Intr + 209739 209851 113 1 2 99 110 146 0.175 18.00 3.17 Intr + 212937 213074 138 2 0 62 105 151 0.975 14.86 3.18 Intr + 215385 215438 54 2 0 85 85 26 0.657 1.18 3.19 Intr + 216886 216988 103 0 1 84 101 190 0.995 19.65 3.20 Intr + 217801 217901 101 2 2 65 91 177 0.999 15.53 3.21 Intr + 218764 218934 171 0 0 100 78 255 0.721 25.84 3.22 Intr + 221005 221116 112 0 1 66 109 16 0.099 1.45 3.23 Intr + 224918 225003 86 0 2 135 89 -17 0.005 2.64 3.24 Intr + 227387 227530 144 1 0 85 89 67 0.177 6.98 3.25 Intr + 232398 232515 118 2 1 57 70 38 0.028 -1.06 3.26 Intr + 238238 238288 51 0 0 72 72 50 0.077 0.68 3.27 Intr + 240679 240732 54 2 0 115 91 59 0.971 7.85 3.28 Term + 241772 241881 110 0 2 44 43 139 0.774 3.67 3.29 PlyA + 242835 242840 6 1.05 4.00 Prom + 252781 252820 40 -7.56 4.01 Init + 255075 255124 50 2 2 48 61 130 0.857 6.72 4.02 Intr + 261098 261146 49 2 1 97 65 32 0.103 0.48 4.03 Intr + 269913 269970 58 2 1 84 63 70 0.038 2.56 4.04 Intr + 285157 285206 50 2 2 98 94 54 0.467 5.40 4.05 Intr + 294007 294162 156 0 0 108 68 33 0.411 3.51 4.06 Intr + 295022 295060 39 1 0 84 103 15 0.250 1.02 4.07 Intr + 299407 299468 62 0 2 140 97 22 0.945 5.83 4.08 Intr + 300855 300896 42 0 0 101 13 95 0.606 0.66 4.09 Intr + 300922 301184 263 1 2 59 61 566 0.902 48.13 4.10 Intr + 301578 301709 132 1 0 87 110 133 0.935 16.02 4.11 Intr + 302333 302433 101 0 2 79 113 168 0.992 18.23 4.12 Intr + 303948 304057 110 2 2 69 64 158 0.906 10.58 4.13 Intr + 306071 306203 133 2 1 68 116 119 0.921 13.35 4.14 Intr + 309184 309318 135 0 0 86 105 276 0.978 29.86 4.15 Intr + 309508 309611 104 0 2 86 36 85 0.798 2.07 4.16 Intr + 309970 310094 125 1 2 134 82 264 0.992 30.63 4.17 Intr + 312470 312582 113 0 2 89 94 119 0.998 12.70 4.18 Intr + 312956 313087 132 1 0 28 115 214 0.510 18.94 4.19 Intr + 314912 315014 103 1 1 120 113 71 0.999 12.55 4.20 Intr + 317374 317474 101 2 2 92 95 39 0.999 4.83 4.21 Intr + 318266 318433 168 1 0 139 115 137 0.999 21.64 4.22 Term + 320597 320701 105 1 0 87 46 146 0.916 8.71 4.23 PlyA + 321258 321263 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 186795 186929 135 2 0 77 57 97 0.960 5.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:10716266_11038399|GENSCAN_predicted_peptide_1|76_aa MVTSTYAEVTGSGTGPEDRANSRDDLENQIHPLYSHCSLLKRVSHHPSNNRTLIRILKEE GPVGPVAPCTEHHIIQ >gi568815595f:10716266_11038399|GENSCAN_predicted_CDS_1|231_bp atggtaaccagcacctatgctgaggttacaggctctgggacgggcccagaggacagggcc aacagccgggatgatttagaaaatcaaatccacccactctattcccactgctccttgctg aaaagggtgtcacaccatccaagtaataacaggacgctcatcagaatcctcaaggaggag ggcccagtgggccccgtggctccgtgcactgagcaccatatcatccagtga >gi568815595f:10716266_11038399|GENSCAN_predicted_peptide_2|310_aa MTGADIYVLEESQDDSEVEDCDVMRKRAVAFILKTDAGGLHQGGTLEDEEGWMTEPLRKP EGRFNNHSSYFTDKEVAAQEVRFLGQGQGTERNIAPMRTDSCDPKEEGRTCSQMPGGSPP SLASGWVSQKITPTLEGAAGLSVLVLSLLRCCDEAKVTVWTAVPSGLHPVPGAYLQAVGV GSPTAGRAVRERMKAEGSQCPQPTIGQASHIQAPVSGQLSFSKETGEGGGKRRGKWEEES EKKEREEEEQREEEEEGEEEGNITASAFLELNFKFYARLILGGNYIDDDDDDDDDDDDDD YDDNDDDDVF >gi568815595f:10716266_11038399|GENSCAN_predicted_CDS_2|933_bp atgactggggcagatatttatgttctggaagaatcacaggatgacagtgaagtggaagac tgtgacgtgatgaggaaaagggctgttgctttcatcctcaagacagatgcgggtggccta catcagggagggaccttggaggacgaggagggctggatgacagaaccactccgcaagcca gaaggtagattcaacaaccatagttcctattttacagataaggaagttgctgctcaagaa gtgaggttccttggtcaaggtcagggaacggaacgtaacattgctcccatgagaacggac agctgtgacccgaaagaggaaggccggacatgcagccagatgcccgggggctcgcctccc agcctggcaagtggctgggtgtctcagaagataactccaacactcgagggggcagccggg cttagtgtccttgtcctgagcctgctgaggtgctgtgatgaggccaaggtcaccgtctgg acagcagtgccatcaggactccacccagtccctggggcgtatctgcaggcagttggtgta ggttccccaaccgctggcagagcggtgagagaaaggatgaaggctgaaggctcccagtgc ccacagccaacaatcgggcaggcttcccacatccaggctccagtctcagggcaactcagt tttagtaaggagactggtgaagggggagggaagaggagaggaaaatgggaggaggagagt gagaagaaggagagggaggaagaggagcagcgggaggaggaggaggaaggggaagaggag ggaaacatcacagcttcagctttcctggaactaaatttcaaattctatgcacgactcatc cttgggggaaactatattgatgatgatgatgatgatgatgatgatgatgatgatgatgat tatgatgataatgatgacgatgatgttttttga >gi568815595f:10716266_11038399|GENSCAN_predicted_peptide_3|1061_aa MTAEKALPLGNGKAAEEARESEAPGGGCSSGGAAPARHPRVKRDKAVHERGHWNNKVEFV LSVAGEIIGLGNVWRFPYLCYKNGGGAFLIPYVVFFICCGIPVFFLETALGQFTSEGGIT CWRKVCPLFEGIGYATQVIEAHLNVYYIIILAWAIFYLSNCFTTELPWATCGHEWNTENC VEFQKLNVSNYSHVSLQNATSPVMEFWELLPVCEMWSAACFIPGAESDCCVLPLCSPVDS GSGRLLLMLSRLSWPPPIRDTQWLLAILNADHLGGARAQPQRGSAGIKLGIKRQVQDQAL PPTGFVISDKVAISVEPEFPLNNGVQDNYTKAPDTSSQTTGSLKTPRLSSVSGTLMTLSH RVLAISDGIEHIGNLRWELALCLLAAWTICYFCIWKGTKSTGKALLQGLNSFSAHNNPMQ KVVYVTATFPYIMLLILLIRGVTLPGASEGIKFYLYPDLSRLSDPQEQASSLHVQVMLMW AAFVASRRPMMRGCLLLSPNAAQGWVNAAFVKDLRHYHSKHKPIRGHRDGFNWSVWVDAG TQIFFSYAICLGCLTALGSYNNYNNNCYRDCIMLCCLNSGTSFVAGFAIFSVLGFMAYEQ GVPIAEVAESGPGLAFIAYPKAVTMMPLSPLWATLFFMMLIFLGLDSQFVCVESLVTAVV DMYPKVFRRGYRRELLILALSVISYFLGLVMLTEKICRSGLTAAFPSNCPPEGGMYIFQL FDSYAASGMCLLFVAIFECICIGWVYGSNRFYDNIEDMIGYRPPSLIKWCWMIMTPGICA GIFIFFLIKYKPLKYNNIYTYPAWGYGIGWLMALSSMLCIPLWICITVWKTEGTLPESLD TNPGTWLWNACSSPRSMPEPGTVPGTEEAPINRTDRGFCGTEKSSNLPTVTQLVNIRTRK RTQDRSLDEQVPSLPQSGEPWEKEYIGIEDLLSAGDHTRHFPMHYLALNCLGQLHDNQAS FSRERKEAPGSAKAGPLKKFIAQRGLWHLKGSSHPTMDAAGLGAVDVGEIEYNTDPSIRC KESLQGLCEVTCIVHRCLKQSLGRTSCHQCKPFTVIIENDV >gi568815595f:10716266_11038399|GENSCAN_predicted_CDS_3|3186_bp atgacggcggagaaggcgctgcccctgggcaatgggaaggctgctgaggaggcgcgggag tccgaggcgccgggtggcggctgcagcagcgggggcgcggcgcccgcgcgccacccgcgc gtcaagcgcgacaaggcggtccacgagcgcggccactggaacaacaaggtggagttcgtg ctgagcgtggccggggagatcattgggctgggcaacgtgtggcgcttcccctacctgtgc tacaagaacggaggaggggcattcctgattccctacgtggtgttttttatttgctgtgga attcctgtttttttcctggagacagctctggggcagttcacaagtgaaggtggcattacg tgttggaggaaagtttgccctttatttgaaggcattggctatgcaacacaggtgattgag gcccatctgaatgtgtactacatcatcatcctggcatgggccattttttacctgagcaac tgcttcactactgagctaccctgggctacctgtgggcatgagtggaacacagagaattgt gtggagttccagaaactgaatgtgagcaactacagccatgtgtctctgcagaatgccacc tcccctgtcatggagttttgggagcttcttcctgtgtgtgaaatgtggtcagctgcctgc tttatcccaggtgcagagtcggactgctgtgtgctgcctctctgcagccctgtggactca ggctctggaaggctgctcctgatgctctcgcgcctatcctggccaccacccatccgcgac acacagtggcttcttgcaatccttaatgcagaccacctcggaggggccagggcacagcca cagaggggcagcgcaggcattaaacttgggatcaagagacaggttcaagaccaggctttg ccacctaccggttttgtgatctcggacaaggtagctatttctgtggagcctgagtttcct ctgaacaatggtgtacaagataactacacaaaagcgcctgacacgtcttcccaaaccacg ggctctttaaagacccccaggcttagctctgtgtctggcacactgatgacgctttcgcac cgggtcctggccatctctgacgggatcgagcacatcgggaaccttcgctgggagctggcc ttgtgtctcttggcagcctggaccatctgttacttctgtatctggaaggggaccaagtct acaggaaaggccctgctccaaggccttaactcattcagtgctcacaacaaccctatgcag aaggttgtatacgtgactgcgacattcccctacatcatgctgctgatcctcctgatacga ggggtcacgttgcccggggcctcagagggcatcaagttctacttgtaccctgacctctcc cggctctccgacccccaggagcaagcgtcgtctttgcatgtgcaggtgatgctcatgtgg gcagcttttgtggcttctcgaaggcccatgatgagaggatgtctcctcctttctcctaat gctgcccaaggatgggttaatgctgcatttgtcaaagacctaaggcattaccacagcaaa cacaagccaattcgaggacatagagatggatttaattggtcggtctgggtagatgctgga acgcagatctttttctcctatgccatttgcctgggctgtctgaccgctctgggaagttat aacaattataacaacaactgctacagggactgcatcatgctctgttgcctgaacagcggc accagcttcgtggctgggtttgccatcttctcagtcctgggttttatggcgtacgagcag ggggtacccattgctgaggtggcagagtcaggccccggcctggcctttattgcgtacccc aaggcggtcaccatgatgcctctctccccgctgtgggccaccttgttcttcatgatgctc atcttcctgggcctggacagccagtttgtgtgtgtggaaagcctggtgaccgccgtggtg gacatgtaccccaaggttttccggaggggttaccggcgggagctgctcatcctagccttg tctgttatctcctattttctgggcctcgtgatgttaacagagaaaatctgcagatctggc ctcacagctgcattccccagcaactgccctccagagggtggcatgtacatcttccagctc tttgactcctatgccgccagtgggatgtgccttctcttcgtggccatctttgagtgcatc tgcatcggctgggtgtatggaagcaaccggttctatgataacattgaagacatgattggc taccggccaccgtcgctcattaagtggtgctggatgatcatgacccctgggatctgcgcg gggatcttcatcttcttcttgatcaagtacaagccactcaagtacaacaacatctacacc tacccagcctggggctatggcattggctggctcatggccctgtcctccatgctctgcatc ccgctctggatctgcatcacagtgtggaagacggaggggacactgcccgagagcctggac acaaacccaggcacctggctttggaacgcgtgctctagccccagatccatgccagagcct ggcacggtgcctggcacagaggaggcacctattaatcgcacagataggggattttgcggc acagaaaagtcaagcaatttgcccacggtaacacagctagtaaacatcaggaccagaaag agaactcaggacaggagcttggatgaacaagtgcccagcttgccccagagtggggagcca tgggaaaaggaatacataggcattgaagacctactcagtgctggagaccataccagacac tttcctatgcattaccttgccctcaactgccttggccagctgcatgacaaccaggcatcg ttcagcagggagaggaaggaggccccagggtcggcaaaggctgggcctttgaagaagttc atagctcagcgtggtctctggcatctcaaagggtcttcccaccccaccatggatgcagct ggtttaggagcagttgacgtgggggagattgaatataacactgatccctccatccgttgc aaggaatctttacaaggcttatgtgaggttacgtgcatcgtgcatcgatgtttgaaacag agcctcggcagaacaagctgtcaccagtgcaagccgttcacggtgataatcgagaacgat gtgtga >gi568815595f:10716266_11038399|GENSCAN_predicted_peptide_4|776_aa MEEEEIKERFLEEEASNPAQEFLAFHGAKRTQETMKKKFEEISAFPEQRPQETLSEWDTS LHFVEVQSQVLSGAQDSAFLTGSQVMLMLSLEEPHFEKQGPESLGNLRPQAGKAVSLSVR WPFLGSQRETLKLTDLTQGGRRRPSGADPPPTTIRVPLSRKTAVHVDRGTPASKFRDMAT NGSKVADGQISTEVSEAPVANDKPKTLVVKVQKKAADLPDRDTWKGRFDFLMSCVGYAIG LGNVWRFPYLCGKNGGGAFLIPYFLTLIFAGVPLFLLECSLGQYTSIGGLGVWKLAPMFK GVGLAAAVLSFWLNIYYIVIISWAIYYLYNSFTTTLPWKQCDNPWNTDRCFSNYSMVNTT NMTSAVVEFWERNMHQMTDGLDKPGQIRWPLAITLAIAWILVYFCIWKGVGWTGKVVYFS ATYPYIMLIILFFRGVTLPGAKEGILFYITPNFRKLSDSEVWLDAATQIFFSYGLGLGSL IALGSYNSFHNNVYRDSIIVCCINSCTSMFAGFVIFSIVGFMAHVTKRSIADVAASGPGL AFLAYPEAVTQLPISPLWAILFFSMLLMLGIDSQFCTVEGFITALVDEYPRLLRNRRELF IAAVCIISYLIGLSNITQGGIYVFKLFDYYSASGMSLLFLVFFECVSISWFYGVNRFYDN IQEMVGSRPCIWWKLCWSFFTPIIVAGVFIFSAVQMTPLTMGNYVFPKWGQGVGWLMALS SMVLIPGYMAYMFLTLKGSLKQRIQVMVQPSEDIVRPENGPEQPQAGSSTSKEAYI >gi568815595f:10716266_11038399|GENSCAN_predicted_CDS_4|2331_bp atggaagaagaagagatcaaggaacgcttcctggaagaagaagcatcaaacccagcccag gagtttctggccttccacggtgcaaagagaactcaggagaccatgaaaaagaagtttgaa gaaatcagtgcctttcccgaacagaggccccaagagaccttgtcagaatgggatacttcc ctgcacttcgtggaggtgcagagtcaggtcttgagtggggcccaagattctgcatttctg acaggctcccaggtgatgctgatgctgtctcttgaagaaccgcactttgagaagcaaggc cctgaatcactgggcaatttgagaccccaggctggcaaggctgtttccttatctgtcaga tggccttttctgggcagccagagggagactcttaaactaaccgatctcacccagggtggc agaaggaggccttctggagctgacccacccccgacgaccatcagggtgcccttgagccgc aaaactgctgtccacgtggaccgggggacccctgcgtccaaattccgagacatggcgacc aacggcagcaaggtggccgacgggcagatctccaccgaggtcagcgaggcccctgtggcc aatgacaagcccaaaaccttggtggtcaaggtgcagaagaaggcggcagacctccccgac cgggacacgtggaagggccgcttcgacttcctcatgtcctgtgtgggctatgccatcggc ctgggcaacgtctggaggttcccctatctctgcgggaaaaatggtgggggagccttcctg atcccctatttcctgacactcatctttgcgggggtcccactcttcctgctggagtgctcc ctgggccagtacacctccatcggggggctaggggtatggaagctggctcctatgttcaag ggcgtgggccttgcggctgctgtgctatcattctggctgaacatctactacatcgtcatc atctcctgggccatttactacctgtacaactccttcaccacgacactgccgtggaaacag tgcgacaacccctggaacacagaccgctgcttctccaactacagcatggtcaacactacc aacatgaccagcgctgtggtggagttctgggagcgcaacatgcatcagatgacggacggg ctggataagccaggtcagatccgctggccactggccatcacgctggccatcgcctggatc cttgtgtatttctgtatctggaagggtgttggctggactggaaaggtggtctacttttca gccacatacccctacatcatgctgatcatcctgttcttccgtggagtgacgctgcccggg gccaaggagggcatcctcttctacatcacacccaacttccgcaagctgtctgactccgag gtgtggctggatgcggcaacccagatcttcttctcatacgggctgggcctggggtccctg atcgctctcgggagctacaactctttccacaacaatgtctacagggactccatcatcgtc tgctgcatcaattcgtgcaccagcatgttcgcaggattcgtcatcttctccatcgtgggc ttcatggcccatgtcaccaagaggtccattgctgatgtggcggcctcaggccccgggctg gcgttcctggcatacccagaggcggtgacccagctgcctatctccccactctgggccatc ctcttcttctccatgctgttgatgctgggcattgacagccagttctgcactgtggagggc ttcatcacagccctggtggatgagtaccccaggctcctccgcaaccgcagagagctcttc attgctgctgtctgcatcatctcctacctgatcggtctctctaacatcactcaggggggt atttatgtcttcaaactctttgactactactctgccagtggcatgagcctgctgttcctc gtgttctttgaatgtgtctctatttcctggttttacggtgtcaaccgattctatgacaat atccaagagatggttggatccaggccctgcatctggtggaaactctgctggtctttcttc acaccaatcattgtggcgggcgtgttcattttcagtgctgtgcagatgacgccactcacc atgggaaactatgttttccccaagtggggccagggtgtgggctggctgatggctctgtct tccatggtcctcatccccgggtacatggcctacatgttcctcaccttaaagggctccctg aagcagcgcatccaagtcatggtccagcccagcgaagacatcgttcgcccagagaatggt cctgagcagccccaggcgggcagctccaccagcaaggaggcctacatctag