GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:12:08 Sequence gi568815590r:42737965_42943094 : 205130 bp : 43.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2608 2647 40 -2.56 1.01 Init + 6384 6540 157 2 1 75 79 38 0.540 1.67 1.02 Intr + 7844 8011 168 0 0 50 68 100 0.384 4.12 1.03 Term + 15806 15960 155 0 2 87 34 55 0.125 -1.92 1.04 PlyA + 16219 16224 6 -0.45 2.07 PlyA - 17053 17048 6 1.05 2.06 Term - 17836 17783 54 1 0 77 54 50 0.424 -1.94 2.05 Intr - 18860 17882 979 1 1 72 84 472 0.529 36.14 2.04 Intr - 19073 18964 110 2 2 60 27 101 0.609 0.28 2.03 Intr - 23870 23592 279 2 0 2 53 236 0.599 9.17 2.02 Intr - 27240 27101 140 1 2 122 89 143 0.951 17.98 2.01 Init - 30466 30388 79 1 1 72 100 43 0.918 3.12 2.00 Prom - 33042 33003 40 -4.86 3.07 PlyA - 34865 34860 6 1.05 3.06 Term - 34895 34875 21 2 0 104 49 4 0.008 -3.59 3.05 Intr - 49887 49701 187 2 1 62 55 110 0.010 4.79 3.04 Intr - 76537 76457 81 0 0 100 89 40 0.417 4.05 3.03 Intr - 76774 76691 84 0 0 92 33 109 0.389 4.74 3.02 Intr - 78772 78705 68 1 2 45 97 51 0.365 -0.60 3.01 Init - 85348 85274 75 1 0 52 108 47 0.477 4.19 3.00 Prom - 91928 91889 40 -5.36 4.04 PlyA - 92851 92846 6 1.05 4.03 Term - 100306 99998 309 1 0 121 42 275 0.515 21.26 4.02 Intr - 101417 101222 196 1 1 69 85 8 0.776 -1.98 4.01 Init - 105130 105060 71 1 2 80 61 178 0.919 14.82 4.00 Prom - 105973 105934 40 -6.46 5.08 PlyA - 107264 107259 6 1.05 5.07 Term - 109020 108920 101 1 2 53 49 131 0.261 3.99 5.06 Intr - 132148 132040 109 1 1 68 91 90 0.432 7.16 5.05 Intr - 136042 135967 76 0 1 54 72 38 0.171 -1.68 5.04 Intr - 149907 149764 144 2 0 120 94 87 0.508 11.90 5.03 Intr - 158355 158290 66 2 0 25 80 105 0.055 1.42 5.02 Intr - 159614 158662 953 2 2 104 9 292 0.015 12.81 5.01 Init - 181134 181051 84 0 0 66 59 111 0.161 4.82 5.00 Prom - 196914 196875 40 -2.46 6.02 PlyA - 197208 197203 6 1.05 6.01 Sngl - 198028 197438 591 1 0 86 50 192 0.925 11.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:42737965_42943094|GENSCAN_predicted_peptide_1|159_aa MASPVENNWYLNVFRMLYKNTIDWVASKLQKFIADSSGDWEAQGKVLAGLVCGAVFFPDP PSSGLINLDDQDTPQSFILVIILSLKNQMIKHLRPHPAHGFEGSRMELEGRCTASVSASD SRVCGKALTPWVTRDFPNCKKRSREATAQRARLILCNGG >gi568815590r:42737965_42943094|GENSCAN_predicted_CDS_1|480_bp atggcatctccagttgagaataactggtatctaaatgtattcaggatgctatataaaaat accatagattgggtggcttctaaactgcagaaatttattgctgacagttctggagactgg gaagcccaaggaaaggtgctggcaggtctggtgtgtggtgcggtgtttttcccagacccg ccctcaagtggcttaataaacctcgatgatcaagacacgcctcagtccttcattttggtc atcattctttctctgaaaaatcagatgatcaagcacctgaggccacatccagcacatggc tttgagggctcccgaatggagctggagggcagatgcacagcctctgtgagtgcctcagac agcagagtctgcggcaaagccctgactccgtgggtcaccagggactttcccaactgcaag aaacgctccagagaagcaactgctcagagagctagactcattctttgcaacggtgggtag >gi568815590r:42737965_42943094|GENSCAN_predicted_peptide_2|546_aa MLTSKGQGFLHGGLCLWLCVFTPFFKGCVGCATEERLFHKLFSHYNQFIRPVENVSDPVT VHFEVAITQLANVPLMQLLCQLIDFSKKEVQGDGTRNGVDAVGLWTPPAEMSDSSAMPIG GIWNWMEREYDLPDRSCSIGSADISEGVHIPMGHVHTTWSFSEPAEIWNDYKLRWDPMEY DGIETLRVPADKIWKPDIVLYNNAVGDFQVEGKTKALLKYNGMITWTPPAIFKSSCPMDI TFFPFDHQNCSLKFGSWTYDKAEIDLLIIGSKVDMNDFWENSEWEIIDASGYKHDIKYNC CEEIYTDITYSFYIRRLPMFYTINLIIPCLFISFLTVLVFYLPSDCGEKVTLCISVLLSL TVFLLVITETIPSTSLVVPLVGEYLLFTMIFVTLSIVVTVFVLNIHYRTPTTHTMPRWVK TVFLKLLPQVLLMRWPLDKTRGTGSDAVPRGLARRPAKGKLASHGEPRHLKECFHCHKSN ELATSKRRLSHQPLQWVVENSEHSPEVEDVINSVQFIAENMKSHNETKEPMGTLDAQSDL QCDFCS >gi568815590r:42737965_42943094|GENSCAN_predicted_CDS_2|1641_bp atgctgaccagcaaggggcagggattccttcatgggggcttgtgtctctggctgtgtgtg ttcacacctttctttaaaggctgtgtgggctgtgcaactgaggagaggctcttccacaaa ctgttttctcattacaaccagttcatcaggcctgtggaaaacgtttccgaccctgtcacg gtacactttgaagtggccatcacccagctggccaacgtgcctttgatgcagctgctgtgt caattaattgatttttccaagaaggaggtacagggtgatggcaccaggaatggagttgat gctgtgggtctttggactcctccggcagagatgtctgatagcagtgccatgcccattggt gggatctggaattggatggagagagagtatgacttgccagacaggtcctgctccataggg tcagctgacatctctgaaggtgtccacatccccatgggccatgtccataccacctggagc ttctccgagcctgcggaaatctggaatgattataaattgcgctgggatccaatggaatat gatggcattgagactcttcgcgttcctgcagataagatttggaagcccgacattgttctc tataacaatgctgttggtgacttccaagtagaaggcaaaacaaaagctcttcttaaatac aatggcatgataacctggactccaccagctatttttaagagttcctgccctatggatatc acctttttcccttttgatcatcaaaactgttccctaaaatttggttcctggacgtatgac aaagctgaaattgatcttctaatcattggatcaaaagtggatatgaatgatttttgggaa aacagtgaatgggaaatcattgatgcctctggctacaaacatgacatcaaatacaactgt tgtgaagagatatacacagatataacctattctttctacattagaagattgccgatgttt tacacgattaatctgatcatcccttgtctctttatttcatttctaaccgtgttggtcttt taccttccttcggactgtggtgaaaaagtgacgctttgtatttcagtcctgctttctctg actgtgtttttgctggtcatcacagaaaccatcccatccacatctctggtggtcccactg gtgggtgagtacctgctgttcaccatgatctttgtcacactgtccatcgtggtgactgtg tttgtgttgaacatacactaccgcaccccaaccacgcacacaatgcccaggtgggtgaag acagttttcctgaagctgctgccccaggtcctgctgatgaggtggcctctggacaagaca aggggcacaggctctgatgcagtgcccagaggccttgccaggaggcctgccaaaggcaag cttgcaagccatggggaacccagacatcttaaagaatgcttccattgtcacaaatcaaat gagcttgccacaagcaagagaagattaagtcatcagccattacagtgggtggtggaaaat tcggagcactcgcctgaagttgaagatgtgattaacagtgttcagttcatagcagaaaac atgaagagccacaatgaaaccaaggagcctatgggcactctggatgcacagtcagacctt cagtgtgacttttgttcctga >gi568815590r:42737965_42943094|GENSCAN_predicted_peptide_3|171_aa MHRRTSLSNGSSINMIGPKHVLKAQRDLDKCFRLEEAHDTSPWVKICKAVCSGGITPSQR ECELKCTGSLDRAGAELFALQVWRLPASDSYFGTGAHGLTSTCGVRGLAGSGVKLLTLTV SVTALKAGRLELFILPGGFVVSLASGVKLQTFVVSVTAHKGSVDPKTSKLK >gi568815590r:42737965_42943094|GENSCAN_predicted_CDS_3|516_bp atgcatagaagaacttcattatcaaatggaagtagtataaacatgattgggcccaagcac gtcctaaaggcacaaagagatctagacaaatgcttccgattagaagaggctcatgatact tctccgtgggtcaagatttgtaaagcggtgtgctcgggcggcatcacacccagccagcgc gagtgtgaactcaagtgcacgggcagccttgaccgggctggagctgagctgtttgcactg caggtttggcgcctgcctgcatctgattcctactttgggacaggagcccatgggctgacg agcacgtgtggggttcgtggtctcgctggctcaggagtgaagctgctgaccctcactgtg agtgttacagctcttaaggcggggcgtctggagttgttcattcttcctggtgggttcgtg gtctcactggcttcaggagtgaagctacagaccttcgtggtgagtgttacagctcataaa ggcagtgtggacccaaaaacctccaaattgaaatga >gi568815590r:42737965_42943094|GENSCAN_predicted_peptide_4|191_aa MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTP DCFKRECNNKLLKENAVPTIFLCTEPHDKVDAAIGLLMPPLQTPVNLSVFCDHNYTVEDT MHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEKLKEVVHFQKEKDDVSERGYVILP NDYFEIVEVPA >gi568815590r:42737965_42943094|GENSCAN_predicted_CDS_4|576_bp atggtgcagtcctgctccgcctacggctgcaagaaccgctacgacaaggacaagcccgtt tctttccacaagtttcctcttactcgacccagtctttgtaaagaatgggaggcagctgtc agaagaaaaaactttaaacccaccaagtatagcagtatttgttcagagcactttactcca gactgctttaagagagagtgcaacaacaagttactgaaagagaatgctgtgcccacaata tttctttgtactgagccacatgacaaggttgatgctgctattggattactaatgccgcct cttcagacccctgttaatctctcagttttctgtgaccacaactatactgtggaggataca atgcaccagcggaaaaggattcatcagctagaacagcaagttgaaaaactcagaaagaag ctcaagaccgcacagcagcgatgcagaaggcaagaacggcagcttgaaaaattaaaggag gttgttcacttccagaaagagaaagacgacgtatcagaaagaggttatgtgattctacca aatgactactttgaaatagttgaagtaccagcataa >gi568815590r:42737965_42943094|GENSCAN_predicted_peptide_5|510_aa MGGSPRPASRPVREVGGSPRPASHPIQEVLSVPGPEPGPSSANAATKARSGGGSWTRGSQ ARPPPERSAAPAASSRTDPRVARRVPSPAPAGVRPEQRRLRPGKSLSDPGPSLLAPPAPG PGCDGTVARSARPTGRAWRVPVGGGGRGSSRRPAAPTYPSEEALAQLRPLQRLYAEHLPR RGQAAPTAWPDTAAALSGDPGARAADSASGPPAPRIRPPTLAQRWQRLRTVTPSAPPSPF TRDRAPAARDFGHARRPASPPPAAHTAPPQRRRPGWSRGRGATAHARAPADPRRRAAAAA AAAAAAAGRAPATSSLRVEASTRSALEAAATGPPPPSGRALAEARQIAPRLSGVAGDGGG ADAESGSRPGMAKYQGEVQSLKLDDDSVIEGVSDQVLVAVVVSFALIATLVYALFRNVHQ NIHPENQELVRVLREQLQTEQDAPAATRQQFYTDMYCPICLHQASFPVETNCGHLFCELA TATSTFSNHDQSAAINIKARLSINKKITIC >gi568815590r:42737965_42943094|GENSCAN_predicted_CDS_5|1533_bp atggggggcagcccccgcccggccagccgccccgtccgggaggtggggggcagcccccgc ccggccagccaccccatccaggaggtcctttcagttcccgggccggagccgggcccgtcc tccgcgaacgcggcaacaaaggcgaggagcggcggcgggtcctggacgcggggcagtcag gctcggccacccccagagcgcagcgcagccccagctgcgagcagccgcacggatcccaga gtcgcccgccgcgttcccagcccagccccggccggggtccgcccggaacagcgacgcctc aggcccggcaagtctctctcggaccctgggccgagcctcctcgccccgcccgccccaggc cccggatgtgacggcaccgtcgcccgcagcgctcgccccacgggccgcgcgtggagggtc ccggtaggtggcggggggagggggtcttcccgccggcccgcggcccccacgtacccaagt gaggaggctctcgcacagctccgcccgctccagcgactctacgctgaacatcttccccgc cgcggccaggcagcgcctaccgcctggccagacaccgccgccgctctgtcgggggatccg ggggcccgcgcagctgactcggcctcaggcccgcctgctcctcggatccggccgccgact ctcgcccagcgctggcagcggctccgcaccgtcacccccagcgcccctccttctcctttc actcgggaccgcgcgccagccgcgcgcgacttcggacacgcccgccgccccgcgtccccg cccccggccgcccacaccgctcctccccagcgccgccgccctggctggagccgaggccga ggagccactgcgcacgctcgcgccccggccgacccgcgccgccgcgccgccgccgccgcc gccgccgccgccgctgccgccgggcgcgcccccgccaccagttcgctgcgtgtcgaggcg agcacgcgctccgccctggaggctgcggcgacgggtcctcctccgccgtccggtcgcgcc ctcgcggaagctcggcaaatcgccccgcggctgtccggagtcgccggcgacggtggaggg gccgacgcggagagcggctctagacctggaatggccaaatatcaaggtgaagttcaaagt ttgaaactggatgatgattcagttatagaaggagtaagcgaccaagtacttgtggcagtt gtggtcagtttcgctttgattgctaccctggtatatgcacttttcagaaatgtacatcaa aacattcacccagaaaaccaggagctagtaagggtacttcgagaacagcttcaaacagaa caggatgcacctgctgccactcgacagcagttctacactgacatgtactgtcccatctgc ctgcaccaagcctccttcccggtggagaccaactgtggacatcttttttgtgaacttgcc acagccacctcaaccttcagcaaccatgatcagtcagcagccatcaacatcaaggcaaga ctctccatcaacaaaaagatcaccatttgctga >gi568815590r:42737965_42943094|GENSCAN_predicted_peptide_6|196_aa MAILPKAIYRFNGIPIQLPLTFFKELEKTTLNFIWNQKRASIAKTILSKKNKVGGITIPD FKLYYKATVTKTAWYWYQSRYTDQWNRTEASEIMPHIYNHLIFDKPDKNKQWEKDSLFNK WCWENWLVICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDTGMGKDF MTKTPKATATKAKIDK >gi568815590r:42737965_42943094|GENSCAN_predicted_CDS_6|591_bp atggccatcctgcccaaagcaatttatagattcaatggtatccccatacagctaccactg actttcttcaaagaattagaaaaaactactttaaatttcatatggaaccaaaaaagagcc agtatagccaagaccatcttaagcaaaaagaacaaagttggaggcatcacaatacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaagcaga tatacagaccaatggaacagaacagaggcctcagaaataatgccacacatctacaaccat ctgatctttgacaaacctgacaaaaacaagcaatgggaaaaggattccctatttaataaa tggtgctgggaaaactggctagtcatatgcagaaaactgaaactggaccccttccttaca ccttatacaaaaattaactcaagatggattaaagacttaaatgtaagacctaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacacaggcatgggcaaagacttc atgactaaaacaccaaaagcaactgcaacaaaagccaaaattgacaaatga