GENSCAN 1.0 Date run: 8-Nov-116 Time: 16:24:32 Sequence gi568815575r:139631657_139926825 : 295169 bp : 39.02% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1219 1214 6 1.05 1.05 Term - 11017 10900 118 0 1 97 48 109 0.262 4.83 1.04 Intr - 15247 15133 115 2 1 110 75 41 0.173 3.59 1.03 Intr - 20132 20064 69 0 0 82 107 10 0.109 0.54 1.02 Intr - 25115 25088 28 2 1 92 44 75 0.017 0.27 1.01 Init - 36369 36307 63 0 0 77 92 55 0.213 6.00 1.00 Prom - 39141 39102 40 -3.65 2.03 PlyA - 39905 39900 6 1.05 2.02 Term - 41577 40128 1450 2 1 -13 42 501 0.026 24.62 2.01 Init - 43771 43626 146 1 2 88 71 122 0.255 10.14 2.00 Prom - 47134 47095 40 -5.85 3.00 Prom + 49032 49071 40 -1.15 3.01 Sngl + 55486 55833 348 0 0 78 40 155 0.908 5.50 3.02 PlyA + 56818 56823 6 1.05 4.03 PlyA - 58878 58873 6 1.05 4.02 Term - 60053 59815 239 0 2 46 40 168 0.814 3.15 4.01 Init - 60526 60259 268 1 1 71 111 296 0.765 25.39 4.00 Prom - 65222 65183 40 -3.05 5.00 Prom + 69547 69586 40 -0.35 5.01 Init + 85302 85478 177 2 0 64 36 146 0.429 6.31 5.02 Intr + 88497 88662 166 2 1 67 44 77 0.219 -0.19 5.03 Intr + 99991 100176 186 2 0 40 93 76 0.029 2.04 5.04 Intr + 111579 111647 69 1 0 110 16 73 0.033 0.54 5.05 Intr + 114084 114104 21 1 0 120 106 27 0.083 4.30 5.06 Intr + 115422 115475 54 1 0 66 100 50 0.077 2.03 5.07 Term + 122125 122393 269 2 2 21 48 168 0.078 0.77 5.08 PlyA + 122457 122462 6 1.05 6.00 Prom + 125429 125468 40 -5.95 6.01 Init + 139872 140007 136 2 1 53 57 155 0.418 9.25 6.02 Term + 140707 141143 437 2 2 50 33 220 0.452 7.16 6.03 PlyA + 141834 141839 6 1.05 7.19 PlyA - 142003 141998 6 1.05 7.18 Term - 143297 143030 268 1 1 53 48 294 0.697 15.88 7.17 Intr - 151072 150891 182 1 2 55 32 235 0.108 12.34 7.16 Intr - 151611 151508 104 1 2 69 89 43 0.984 1.47 7.15 Intr - 153643 153570 74 0 2 75 83 93 0.993 5.63 7.14 Intr - 156687 156536 152 0 2 115 98 136 0.984 15.34 7.13 Intr - 157832 157671 162 2 0 101 52 103 0.958 7.25 7.12 Intr - 164814 164617 198 0 0 94 66 168 0.439 13.83 7.11 Intr - 165670 165520 151 0 1 94 86 15 0.374 1.14 7.10 Intr - 170683 170580 104 1 2 75 71 150 0.217 10.05 7.09 Intr - 172943 172815 129 2 0 86 87 69 0.989 6.57 7.08 Intr - 183329 183222 108 2 0 88 63 101 0.978 7.16 7.07 Intr - 185287 185207 81 1 0 80 99 71 0.987 6.32 7.06 Intr - 195167 195048 120 2 0 77 115 117 0.999 13.07 7.05 Intr - 219684 219517 168 0 0 67 62 68 0.088 1.32 7.04 Intr - 222095 221954 142 1 1 62 17 111 0.097 0.83 7.03 Intr - 224267 224116 152 2 2 58 65 111 0.276 3.84 7.02 Intr - 226106 225971 136 1 1 80 45 70 0.066 1.55 7.01 Init - 234789 234746 44 0 2 100 42 64 0.079 1.08 7.00 Prom - 253084 253045 40 -4.25 8.00 Prom + 262275 262314 40 -5.15 8.01 Init + 292471 292619 149 0 2 66 98 145 0.923 12.91 8.02 Term + 293194 293323 130 1 1 38 55 77 0.713 -3.93 8.03 PlyA + 293465 293470 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 41417 40128 1290 2 0 70 42 430 0.914 32.32 S.002 Init - 144123 144002 122 0 2 59 100 58 0.886 3.84 S.003 Term - 151072 150887 186 1 0 55 36 235 0.892 11.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:139631657_139926825|GENSCAN_predicted_peptide_1|130_aa MGLLQLGLRHKLKMGYHYVAQEKIRVEIREETDLLLRIKDISHFLMQDIAFLSGGRGKDN AWIITFPENCNFRCIPEEVIAKVLTYLTSIASIAATVLMEAATAALFDRDFWLPFLNTTQ EAARISSTLR >gi568815575r:139631657_139926825|GENSCAN_predicted_CDS_1|393_bp atggggctcctacagctggggctgaggcacaagctaaaaatgggctaccactatgttgct caggaaaaaatccgtgtggaaattcgagaggaaaccgatctgctcctccgtataaaggac atcagtcatttcttaatgcaagacatcgccttcttgtctggtggccggggaaaggacaat gcttggatcattacgtttccagaaaactgtaattttagatgtataccagaggaagtaata gcaaaagtacttacttacctgacatctattgcaagcattgctgccactgtgctaatggaa gcagccacggcagctttgtttgatagagatttttggctgccgtttttaaatactacccaa gaagcagctcgtatttcatcaacgttgcgttga >gi568815575r:139631657_139926825|GENSCAN_predicted_peptide_2|531_aa MGRNQSRNAENSKNQRAFSPPKDCISSPAMEQSWTENDFDELTEVGFRRLLARLIKKKRE KNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEL ESLNRPITGSEIEAIISSLRTKKSPGPDGFTAEFYQRYKEELVLFLLKLFQSIEKEGILP NSFYEASIILIPKPGRDTTKKENFRLISLMNIDVKILNKILANRIQQHIKKLVHHDQVGF IPGMPDWFNICKSINIIHHINRTNGKNHMIISIDAEKAFEKIQQCFMLKTLNKLGIDGMY LKIIRAIFDKPTANVILNGQKLEAFPLKTGTTQGCPLSPLLFKIVLEVLARAIRQEKEIK RIQLGKEEAKLSLFADDMIVYLENPIVSAQNLLQLRSNFSKVLGYKINMQKSQAFLYTKN KQTESQIMSELPFTIATKRIKYLGIQLTRDVKSLFKENYKPLLSKIKEDTNKWKNIPSSW IGRISIAKMAILPKVIYTFNAIPIKLPMTFFTELEKTTLKFIWNQKRASIA >gi568815575r:139631657_139926825|GENSCAN_predicted_CDS_2|1596_bp atggggagaaaccagagcagaaatgctgaaaattctaaaaaccagagagccttttctcct ccaaaggattgcatctcctcgccagcaatggaacaaagctggacagagaatgactttgat gagctgacagaagtaggcttcagaagactgctagcaagactaataaagaagaaaagagag aagaatcaaatagacgcaataaaaaatgataagggggatatcaccaccgatcccacagaa atacaaactaccattagagaatactacaagcacctctacgcaaataaactagaaaatcta gaagaaatggataaattcctcgacacatacaccctcccaagactaaaccaggaagaactt gaatctctgaatagaccaataacaggctctgaaattgaggcaataattagtagcctacga acaaaaaagagtccaggaccagatggattcacagccgaattctaccagagatacaaggag gagctggtactattccttctgaaactattccaatcaatagaaaaagagggaatcctccct aactcattttatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaa aaagagaattttaggctaatatccctgatgaatatcgatgtgaaaatcctcaataaaata ctggcaaaccgaatccagcagcacatcaaaaagcttgtccaccatgatcaagttggcttc atccctgggatgccagactggttcaacatatgcaaatcaataaacataatccatcacata aacaggaccaacggcaaaaaccatatgattatctcaatagatgcagaaaaggcctttgag aaaattcaacagtgcttcatgctaaaaactctcaataaactaggtattgatggaatgtat ctcaaaataataagagctatttttgataaacccacagccaatgtcatactgaatgggcaa aaactggaagcatttcctttgaaaactggcacaacacaggggtgccccctctcaccactc ctattcaagatagtgttggaagttctggccagggcaatcaggcaagagaaagaaataaag cgtattcagttaggaaaagaagaagccaaattgtccctgtttgcagatgacatgattgta tatttagaaaacccaattgtctcagcccaaaatctccttcagctgagaagcaacttcagc aaagtcttaggatacaaaatcaatatgcaaaaatcacaagcattcctatacactaagaac aaacaaacagagagccaaatcatgagtgaactcccattcacaattgctacaaagagaata aaatacctaggaatccaacttacaagggatgtgaagtccctcttcaaggagaactacaaa ccactgctcagcaaaataaaagaggacacaaacaaatggaagaacattccaagctcatgg ataggaagaatcagtatcgcgaaaatggccatactgcccaaggtaatttatacattcaat gccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaag ttcatatggaaccaaaaaagagcctccattgcctag >gi568815575r:139631657_139926825|GENSCAN_predicted_peptide_3|115_aa MLLSQGLSTDCPLFLKHSPPVTSGSSLPTSFRILLNGIFPVLEVTRSHSDSYICHKHTMR GSLGEKINVTSVTLASGQELTVSGTRRRSNRKRHWPCQPSDRRHLSSWDWPLLLP >gi568815575r:139631657_139926825|GENSCAN_predicted_CDS_3|348_bp atgctcttgtctcaaggcctttccactgactgccccctcttcctgaaacactctccccca gtcacctctggtagttcacttcctacctccttcaggattttgctcaatggcatcttccca gtgctggaagtaacaagaagtcactcagattcttacatctgccataaacatacaatgagg ggaagtttgggtgaaaaaataaatgtaacttctgtgacactggcctctggccaggagcta actgtctctggaacaagacgtcgaagcaacagaaagaggcactggccttgtcagccatca gatagaaggcatctctcctcatgggattggcctctgctcctgccctag >gi568815575r:139631657_139926825|GENSCAN_predicted_peptide_4|168_aa MAAAGARPYLGSPRRAARHMPQAAASAPWTSVSCGRIPGLPQNPDGCTFMPIAQAIPLQV DMAMALQRVRIAEKYGKNYLCLLQAVFVAVALPATNSLGHPFSVPLASCRAAGVQLWSQL AGGEIRSSGPAGGPWESAGSSMKLSLPLLVQVSYPLLTHFGASPEKWE >gi568815575r:139631657_139926825|GENSCAN_predicted_CDS_4|507_bp atggcagcggcaggggcccggccctacctgggttcgccaaggcgggcggccaggcacatg ccccaagctgccgcttcggcgccgtggacctcagtgtcctgcggcaggatcccaggcctg ccccagaacccagatggctgcactttcatgccgatagcccaggcgatccctctgcaggtg gacatggccatggccctgcagcgcgtgaggattgccgagaagtacgggaagaactacctc tgcctcctgcaagcagtcttcgtggcagtggcgctcccagccacgaactcgctgggtcat ccgttctccgttcctctggccagctgccgcgctgcaggcgtgcagctctggtctcagctc gcaggaggtgagatcaggagctctggtcccgcaggaggaccgtgggaaagtgcaggaagc agtatgaaactgtctctgccattgctggttcaggtctcctatccccttctcacccacttt ggtgcgtccccggagaagtgggaatag >gi568815575r:139631657_139926825|GENSCAN_predicted_peptide_5|313_aa MVKGGISVIRQPWNTFTAALGPQISHCGRQRRNRRDLQMPYGESCVHELLRERAKEMTLC LFFLPEEKGKLEGWSAASWEPETAWVVRLWSMEAKAKRLAVSRGVSTKGEDKQQGTCYNT LDSSENVRKRRGLTEGLADNESDALLQLRFLCNSEAKILKAFENWLTYLVIIYLKKACLY ATYHNALKTAIKKKQIFDIRMQSEKEKKWQNEVAASQKQALPIATWVTEQEKKKEKKGKG KGKKRRRKGNQRKKLERGQINNPTSQLEELQKQEQTNPKARRKQEITQIRAELRETEMQK NHTKDQEIQEMVI >gi568815575r:139631657_139926825|GENSCAN_predicted_CDS_5|942_bp atggtgaaaggtggtattagtgtaatcaggcagccatggaacaccttcacagcagctttg gggccacagatatctcactgtgggaggcagagaagaaacagaagagaccttcagatgcca tatggagagagctgtgtccatgagctcttacgagaacgtgcaaaagagatgactttgtgt ctgttcttccttcccgaagagaaaggcaagctggagggctggtctgcagcatcctgggag cctgagacagcctgggttgttaggctgtggagcatggaggcaaaggcaaaaagattagca gtgtcccgtggagtctccacaaaaggggaagataaacagcaaggtacctgttacaataca ttagattcgtctgagaatgttcgtaaaagaagaggtctgactgaaggtctggcggataat gagtcagatgcccttctacagctcagatttctctgtaatagtgaggcaaagatacttaaa gcatttgaaaattggttaacctacctggtgattatatatttaaaaaaagcctgtttatat gccacttatcataacgccttaaaaacagcaataaagaagaaacagattttcgacataagg atgcagtctgaaaaagaaaaaaaatggcagaatgaggtagctgcttcccagaaacaagct ctccctatagccacctgggtgacagagcaagagaaaaagaaagaaaagaaagggaaaggg aaagggaagaaaagaagaagaaaaggaaaccaaagaaagaagttagaaagaggtcaaatt aacaacccaacatcacaactagaggaactacagaaacaagagcaaaccaaccccaaagct agaagaaaacaagaaataacccaaatcagagctgaactgagggaaactgagatgcaaaaa aaccatacaaaagaccaagaaatccaggagatggttatttga >gi568815575r:139631657_139926825|GENSCAN_predicted_peptide_6|190_aa MWESLEPPRHLLNGFDKNADSDMKSKVQAEVVSDRDEELVGNRSKVQKGNVVLESPYRVP TGALISGAMRRGPPSSRPQNGRSTDSLHCAPGKAADTQCQPVKAARRETIPCKATEAELP KTMGTHLLHQHDLDVRHGVKVDHFGALIIDCPAGFQTCMGPLIPLFWPISPIWNGYIYPI PVPPLYLGSN >gi568815575r:139631657_139926825|GENSCAN_predicted_CDS_6|573_bp atgtgggaaagtttggaacctcctagacacttgttgaatggctttgacaaaaatgctgat agtgatatgaaaagtaaggttcaggctgaggtggtctcagacagagatgaggaacttgtt gggaaccggagcaaagtgcagaagggaaatgtggtgttggagtccccatacagagtccct actggggcactgattagtggagctatgagaagagggccaccatcctccagaccccagaat ggtagatccactgacagcttgcactgtgcacctggaaaagccgcagacactcaatgccag cctgtgaaagcagccaggagggagactataccctgcaaggccacagaggcagagctgccc aagaccatgggaacccacctcttgcatcagcatgacctggatgtgagacatggagtcaaa gtagatcattttggagctttaataattgactgccctgctggatttcagacttgcatgggc cctctaatccctttgttttggccaatttctcccatttggaacggctatatttacccaata cctgtacccccattgtatctaggaagtaactag >gi568815575r:139631657_139926825|GENSCAN_predicted_peptide_7|824_aa MPGFLKLFICVIMGSKARYFYIEGCALTDGIMVSVHLDKGGEGVLTLTHVAPAAALFPYW WSTEHWTNYDIKVYIGGQASWLTLGSLSKSPQIKWSQFTNVQSEESQEGQSKAIAATDRM YLGHLRVTGLRIFDKKATGLSMASVLSGYVLVYTDNKVNWHHVDTTKVYGMDFVEQQLDP HLGSLEPWLGWPWSAGLECREERLWADSLWSMPQCAGEEKRVGTRTVFVGNHPVSETEAY IAQRFCDNRIVSSKVTVDTPTSPVTSGLPLFFVITVTAIKQGYEDCLRHRADNEVNKSTV YIIENAKRVRKESEKIKVGDVVEVQADETFPCDLILLSSCTTDGTCYVTTASLDGESNCK THYAVRDTIALCTAESIDTLRAAIECEQPQPDLYKSINAFLIVYLFILLTKAAVCTTLKY VWQSTPYNDEPWYNQKTQKERETLKVLKMFTDFLSFMVLFNFIIPVSMYVTVEMQKFLGS FFISWDKDFYDEEINEGALVNTSDLNEELGQVDYVFTDKTGTLTENSMEFIECCIDGHKY KGVTQEVDGLSQTDGTLTYFDKVDKNREELFLRALCLCHTVEIKTNDAVDGATESAELTY ISSSPDEIALVKGAKRYELLHTLNFDAVRRRMSVIVKTQEGDILLFCKGADSAVFPRVQN HEIELTKVHVERNAMDGYRTLCVAFKEIAPDDYERINRQLIEAKMALQDREEKMEKVFDD IETNMNLIGATAVEDKLQDQAAETIEALHAAGLKVWVLTGDKMETAKSTCYACRLFQTNT ELLELTTKTIEESERKEDRLHELLIEYRKKLLHEFPKSTRSFKK >gi568815575r:139631657_139926825|GENSCAN_predicted_CDS_7|2475_bp atgcctggcttcctgaagctttttatctgtgtgatcatgggcagcaaggcaaggtacttc tatatagaagggtgcgcccttacagatggaataatggtgagcgtacacttggacaaggga ggggaaggggttcttaccctgacgcacgtggcccctgctgctgcgttgttcccctattgg tggtccacagaacactggaccaactacgatataaaggtctacatcggggggcaagcctcc tggttgacactggggtctttatcgaaatctccccagattaaatggtcccaatttactaat gtccagtctgaggagagtcaggagggacagagcaaagccattgctgctacagatagaatg tatttaggccatctgcgggttactgggttaaggatttttgataagaaggctacggggttg tccatggcctcagtgctttcgggctacgtccttgtttacactgacaacaaagtgaattgg caccatgtagacaccaccaaggtttatggcatggactttgtggagcagcagctcgatcct cacctgggctcccttgagccatggctggggtggccatggagtgctgggctggaatgcagg gaggagagactctgggcagatagcctgtggagtatgccccagtgtgctggagaagagaaa cgagttggcacacgcacagtgtttgttggcaatcatccagtttcggaaacagaagcttac attgcacaaagattttgtgataatagaatagtctcatctaaggtcacagtagacacacca actagcccagttaccagtggacttccacttttctttgttataactgttacagccatcaag cagggatatgaggattgtctgagacacagagctgacaatgaagtcaacaaaagcactgtt tacattattgaaaatgcaaagcgagtgagaaaagaaagtgaaaaaatcaaggttggtgat gtagtagaagtacaggcagatgaaacctttccctgtgatcttattcttctatcatcttgc accactgatggaacctgttatgtcactacagccagtcttgatggggaatccaattgcaag acacattatgcagtacgtgataccattgcactgtgtacagcagaatccatcgataccctc cgagcagcaattgaatgtgaacagcctcaacctgacctctacaaatctattaatgctttc ctgattgtatatttatttatcttactgaccaaagctgcagtatgcactactctaaagtat gtttggcaaagtaccccatacaatgatgaaccttggtataaccaaaagactcagaaagag cgagagaccttgaaggttttaaaaatgttcaccgacttcctatcatttatggttctattc aactttatcattcctgtctccatgtacgtcacagtagaaatgcagaaattcttgggctcc ttcttcatctcatgggataaggacttttatgatgaagaaattaatgaaggagccctggtt aacacatcagaccttaatgaagaacttggtcaggtggattatgtatttacagataagact ggaacactcactgaaaacagcatggaattcattgaatgctgcatagatggccacaaatat aaaggtgtaactcaagaggttgatggattatctcaaactgatggaactttaacatatttt gacaaagtagataagaatcgagaagagctgtttctacgtgccttgtgtttatgtcatact gtagaaatcaaaacaaacgatgctgttgatggagctacagaatcagctgaattaacctat atctcctcttcaccagatgaaatagctttggtgaaaggagctaaaagatatgaacttctt cacaccttaaactttgatgctgtccggcgacgtatgagtgtaattgtgaagactcaagaa ggagacatacttctcttttgtaaaggagcagactcggcagtttttcccagagtgcaaaat catgaaattgagttaactaaagtccatgtggaacgtaatgcaatggatgggtatcggaca ctctgtgtagccttcaaagaaattgctccagatgattatgaaagaattaacagacagctc atagaggcaaaaatggccttacaagacagagaagaaaaaatggaaaaagttttcgatgat attgagacaaacatgaatttaattggagccactgcagttgaagacaagctacaagatcaa gctgcagagaccattgaagctctgcatgcagcaggcctgaaagtctgggtgctcactggg gacaagatggagacagctaaatccacatgctatgcctgccgccttttccagaccaacact gagctcttagaactaaccacaaaaaccattgaagaaagtgaaaggaaagaagatcgatta catgaattattgatagaatatcgcaagaaattgctgcatgagtttcctaaaagtactaga agctttaaaaagtaa >gi568815575r:139631657_139926825|GENSCAN_predicted_peptide_8|92_aa MKIHELGDAPERKPASVDAKLAETSILPGSPTGCINLPPAGCSSASSDKQLGPNEPFPVT DFALYPFAIINHIRKRFYVLSSASESTNLGRF >gi568815575r:139631657_139926825|GENSCAN_predicted_CDS_8|279_bp atgaaaatccacgaacttggcgatgctccagagaggaaaccagcaagtgttgacgctaaa ctggcagaaacatcaatacttcctggctcccccactgggtgcatcaatttaccaccggca ggctgctcttctgcttcaagtgacaaacaacttggccccaatgagccttttcccgttact gactttgctctgtatccttttgctataataaatcacatccgcaagcgtttctatgtactg agttctgccagtgaatcaacaaacctaggccggttttga